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A Statistical Analysis of Bell System 
Building Fires, 1971-1977 


By A. R. ECKLER 
(Manuscript received December 23, 1977) 


About 200 fires in Bell System buildings and adjacent grounds (ex- 
cluding Western Electric) are reported to AT&T each year; the actual 
number of fires that occur may be somewhat higher. The dollar damage 
of reported fires (excluding only the $60 million fire in New York City 
on February 27, 1975) is reasonably modeled above the median by a log 
normal probability density function. This paper introduces a detailed 
taxonomy of fires, showing substantial differences in their frequency 
and costliness. The paper concludes with various special topics: (i) an 
analysis of employee injuries and service interruptions caused by fire; 
(it) the correlation of business hours with fire frequency and building 
occupancy with fire severity; (iit) the methods used to fight fires; (iv) 
an analysis of multiple fires in buildings and of a cluster of fires in the 
Greater New York area in March 1975. 


l. INTRODUCTION 


Fires occurring in Bell System operating company or Long Lines 
telephone buildings or adjacent grounds, but not fires in Western Electric 
plants, are reported to AT&T on a standard form entitled “American 
Telephone and Telegraph Company Fire Report—Buildings” (Form 
E-5000), issued in 1962 and revised in 1969 and 1976. This paper analyzes 
approximately 1500 of these reports, covering fires that occurred in the 
years 1971 through 1977. Although Bell System summary statistics on 
fires go back a decade or more, few reports on fires prior to 1971 are, 
apparently, now available. The issuance of a fire report is governed by 
the following definition of a fire: 
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Any occurrence that produces heat or flame and smoke in tele- 
phone company property or leased space, that affects service, 
causes property or equipment damage, and/or endangers inhabi- 
tants. 

For the purpose of this paper, this has been interpreted to mean that a 
fire is characterized by an open flame, arcing, or sparks, visible smoke, 
or a combination of these; if the fire is out before it is detected, the site 
is marked by ashes, charred areas, or discoloration. Furthermore, an 
explosion is counted as a fire. However, a burning odor unaccompanied 
by smoke that cannot be traced to evidence of the above nature is not 
counted as a fire. All fires on company-occupied premises, either owned 
or leased, are supposed to be reported. Specifically, in this paper a fire 
is included if (i) it begins on non-Bell property and spreads to Bell 
property, damaging it (including water damage by fire fighters) or (ii) 
it occurs in a vehicle parked on Bell premises. However, a fire is not in- 
cluded if (i) it occurs in a PBX, telephone closet, or similar place in a 
building owned or leased by the telephone user, (ii) it occurs in a Bell- 
owned car or truck off Bell premises, (iii) it occurs on adjacent property 
but does not spread to Bell property, (iv) it occurs in Bell System outside 
plant such as manholes and cables, or (v) it occurs in a Bell-owned 
building not being used for telephone purposes and slated for eventual 
demolition and replacement (often, these are vacant but sometimes they 
are rented to tenants for a few months). Car and truck fires off Bell 
premises are recorded on a standard form entitled “American Telephone 
and Telegraph Company Fire Report—Motor Equipment” (Form E- 
5000 ME). 

It is quite clear that reported fires do not represent all the building 
fires in the Bell System under the above definitions. Very small fires, 
such as a lighted match dropped on a carpet and immediately stamped 
out, are rather unlikely to be reported. Also, since fires are relatively rare 
events, employees may not be aware of the reporting procedure to be 
followed. 

The “Report of Abnormal Service Conditions” (Form E-3877), tele- 
phoned to AT&T by operating companies when telephone service is 
threatened or interrupted, includes a few building fires (10 to 15 per 
year). Table I shows that over two-thirds of these fires are also included 
in the fire reports. If service reports and fire reports are filed indepen- 
dently, this yields an approximate estimate of under-reporting of fires. 


Table | — Reports of abnormal service conditions involving 
building fires 
1972. 1973 1974 1975 1976 1977 Total 
Fire report 8 13 5 11 11 7 55 
No fire report 3 3 3 4 7 1 21 
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This paper is restricted to fires reported to AT&T, and the reader should 
keep this potential under-reporting of Bell System fires in mind. Spe- 
cifically, always ask the question: Are reported fires typical of all fires 
with respect to the characteristic under discussion? 

There are several reasons why damage information in the fire report 
should be regarded with caution. These estimates are highly rounded 
(to quantities such as $100, $200, $500); furthermore, they are usually 
made a day or so after the fire, long before the actual bills are in. The 
dollar values presumably reflect replacement costs, and do not allow for 
depreciation; furthermore, there is no indication whether labor costs 
associated with clean-up and repair have been included. If the fire does 
less than $32 or so in damage, there is a strong possibility that it will be 
rounded down to zero; over 30 percent of all fires are so reported. There 
is no reporting mechanism for providing AT&T with more accurate 
follow-up reports of fire costs. However, trends in costs and comparisons 
of different cost distributions should be relatively immune to these 
problems. 


Il. BELL SYSTEM BUILDING FIRE EXPERIENCE 


This section, the core of the paper, summarizes Bell System building 
fire experience. The first two parts analyze year-by-year changes in the 
number of fires per year and the probability density function of fire 
damage for the Bell System as a whole. The third part presents a detailed 
taxonomy of fires, showing which kinds are most frequent or most 
costly. 


2.1 Number of fires per year 


Table II and Fig. 1 summarize the number of Bell System building 
fires (excluding 73 Bell Canada fires, since Bell Canada left the System 
in 1975) which occurred from 1971 through 1977. The upper set of points 
in Fig. 1 includes all fires, no matter how small the damage, but the lower 
set includes only those fires with reported damages of $32 or more. A 
statistical chi-squared test on the Poisson counts! rejects (at the 0.02 
level) the null hypothesis that the average number of fires per year is 
constant; in fact, the figure suggests that there has been a downward 
drift. However, this inhomogeneity can be explained by differential di- 
ligence from year to year in reporting very small fires, for the same test 
on the homogeneity of Poisson counts confirms that more expensive fires 
occur at a constant average rate of a little more than 100 per year. 


Table I! — Bell System fire frequency, 1971-1977 


1971 1972 1973 1974 1975 1976 1977 


All fires 244 209 186 191 194 174 212 
Fires over $32 120 102 97 100 95 108 104 
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Fig. 1—Bell System building fire frequency, 1971-1977. 


Even if inhomogeneities in the fire frequency had been revealed by 
the data, this would not necessarily have been reason for complacency 
or alarm. The Bell System continuously evolves in many ways; for ex- 
ample, the number of operating company employees and construction 
dollars have had year-to-year declines in the 1971-77 period. Further- 
more, AT&T updated its fire protection policies and introduced fire- 
retardant materials in telephone equipment in the late 1960s; the ben- 
efits of these and similar actions have spread through the Bell System 
during the 1971-77 period. Changes in Bell System fire frequency, if they 
occur in the future, are likely to be compncated functions-of many Bell 
System characteristics. 

The important role played by small fires can be illustrated in another 
way. Counting C&P as one company, there are 19 operating companies 
plus Long Lines in the Bell System; it is a straightforward matter to 
calculate the expected number of fires for each of these in 1971-76 under 
the assumption that fires per million square feet per year are constant 
over the Bell System. One can then look at the likelihood of the actual 
counts, based on Poisson distributions having the expected counts as 
their means. It turns out that eight operating companies are in the upper 
10-percent tail of the Poisson distribution (one at the 0.99999971 level), 
and six more are in the lower 10-percent tail of the Poisson distribution 
(one at the 0.0000002 level). However, if only fires greater than $32 are 
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considered, most of this inhomogeneity vanishes; three companies lie 
in the upper 10-percent tail and four in the lower 10-percent tail (two 
companies should be in each tail because of normal statistical fluctua- 
tions). There is strong evidence of differential reporting of small fires 
among operating companies as well as for different years; fires which do 
significant damage (more than $32) are more reliably reported than small 
ones. 

It is somewhat more meaningful to normalize Bell System fire fre- 
quency by relating it to floor area. The overall Bell System fire frequency 
is approximately 0.7 fires per million square feet of floor space per year. 
If the 15 percent of external (and roof) fires are removed from the total, 
the rate is reduced to 0.6. (However, if allowance is made for unreported 
fires as discussed above, the rate is increased to 0.85.) Although quan- 
titative comparisons of this with other industries are hard to ascertain, 
one study by Factory Mutual Research? for the Naval Facilities Engi- 
neering Command divides properties on naval bases into three risk 
categories: less than 1 fire per million square feet per year (communi- 
cations facilities, clinics, electronic data processing facilities, hospitals, 
outside storage, offices, child care centers, schools, vacant buildings, 
mobile equipment, warehouses), between 1 and 3 fires (aerospace 
manufacturing facilities, churches, cold storage plants, laundries, cafe- 
terias, stores, theaters), and more than 3 fires (gasoline stations, barracks, 
clubs, laboratories, utilities and power plants, homes, recreational areas). 
To get a breakdown of Bell System fire frequency by type of space, it is 
useful to subdivide Bell System floor space into analogous categories. 
However, this task is hampered by the lack of centrally compiled sta- 
tistics on Bell System occupancy. Based on rough estimates of Bell 
System floor areas (obtained, in the first three cases, by scaling up New 
Jersey Bell floor areas), the number of reported fires per million square 
feet per year is about 0.3 in switchrooms, 1.0 in power rooms, 0.2 in cable 
vaults, 0.5 in Community Dial Offices, and 1.0 in repeater huts and mi- 
crowave stations. 


2.2 Probability density function of fire damage 


It is useful to summarize the Bell System data on fire damage by 
means of a probability density function characterized by a small number 
of parameters. Because of the extremely wide range of fire damage (most 
are a few hundred dollars or less, but fires exceeding $100,000 have oc- 
curred each year, and the Second Avenue fire in New York in 1975 was 
valued at $60 million), it is necessary to restrict oneself to probability 
density functions in which the independent variable is expressed in 
logarithms. Two common ones exist—the Weibull and the log normal; 
the latter turned out to fit the data quite well and is the one that was 
eventually selected. 
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Figure 2 shows the empirical cumulative distribution functions of fire 
damage for each of the seven years. (The totals do not agree with those 
in Section 2.1 because some fire reports omit damage estimates.) Because 
of the already-mentioned tendency of fire reports to round small-damage 
fires down to zero, only the upper half of the distribution (above the 50th 
percentile) is shown. The ith largest fire damage for a year having n fires 
is plotted at the point (x,y) corresponding to (log dollars, 1 — (i — 1)/n); 
to avoid clutter, only the fires corresponding toi = 1(1) 10(2) 20(5) 50(10) 
100 have actually been plotted. 

In view of the noticeably greater variability of the data in the upper 
tail of the distribution, it was decided to fit a straight line to each cu- 
mulative distribution by a least-squares line fitted to the 50th, 60th, 70th, 
80th, and 90th percentiles listed in Table III. (In this regression, the 
independent variables are the Gaussian deviates of the percentiles, and 
the dependent variables are the fire damages in log dollars.) After it was 
found that the slopes of the six lines did not significantly differ from each 
other, the model was reformulated: seven lines parallel to each other were 
fitted to the data instead. This was accomplished by reducing the 50th 
through 90th percentiles of the 1972 through 1977 data to the 1971 level 
by subtracting the average difference of the damage, as shown in the final 
column of Table III. The common value of the standard deviation (the 
slope) is 1.386 (in log dollars), and the median fire damage is given for 
each year in Table IV. 

It is frequently useful to know the average fire damage as well. Because 
of the skewed nature of the log normal distribution, this is ordinarily a 
much larger value than the median. If m and s represent the mean and 
standard deviation of the log normal distribution in log dollars, the mean 
of the distribution in dollars is given by the formula:® 


M = exp(m log,.10 + (s log. 10)2/2). 


The values of M for the various years are also given in Table IV. 

Why go through this involved procedure to calculate the average fire 
damage when an unbiased estimate of this quantity can be easily ob- 
tained by taking an average of the recorded fire costs? Unfortunately, 
the variance of such an estimate is quite large, for it depends almost 
entirely upon the values of the half-dozen largest fires in the set. The 
estimate M given above is based on m and s, which are far more repre- 
sentative of all the fires in the sample, not just those in the extreme 
tail. 

It is likely that the observed differences in the median fire damages 
from one year to another are, at least in part, due to the inflation in repair 
costs over these years. T’o obtain an estimate of the percentage inflation 
rate, one can fit a straight line to the estimated median fire values (in 
log dollars), using the year as the independent variable. The slope of this 
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Fig. 2—Cumulative distribution function of fire damage, 1971-1977 (number of fires in parentheses). 


Table Ill — Adjustment of fire damage (in log dollars) to 1971 level 


Percentile (Gaussian deviate) 


50 60 70 80 90 
Year (0.000) (0.253) (0.524) (0.842) (1.282) 

1971 1.54 2.00 2.40 2.72 3.37 
1972 original 1.59 1.88 2.30 2.70 3.18 

corrected 1.67 1.96 2.38 2.78 3.26 0.08 
1973 original 1.86 2.18 2.69 3.00 3.79 

corrected 1.56 1.88 2.39 2.70 3.49 —0.30 
1974 original 1.70 2.00 2.40 2.72 3.48 

corrected 1.65 1.95 2.35 2.67 3.43 —0.05 
1975 original 1.65 2.10 2.42 2.88 3.43 
; corrected 1.56 2.01 2.33 2.79 3.34 —0.09 
1976 original 2.00 2.30 2.49 2.95 3.69 

corrected 1.72 2.02 2.21 2.67 3.41 —0.28 
1977 original 1.70 2.18 2.48 3.01 3.70 

corrected 1.49 1.97 2.27 2.80 3.49 -0.21 
Predict value 1.60 1.95 2.33 2.77 3.38 


Table IV — Median and average damage per 
fire for years 1971-1977 


Median Average 
1971 $40 $ 6481 
1972 $33 $ 5390 
1973 $79 $12931 
1974 $45 $ 7272 
1975 $50 $ 7973 
1976 $76 $12349 
1977 $65 $10511 


fitted line is 0.04 in log dollars, or a rate of 10 percent per year, somewhat 
larger than the inflation rate corresponding to the well-known consumer 
price index from 1971 through 1977. 

The 1975 New York fire, the most costly one in Bell System history, 
is not consistent with the log normal distribution. Its $60 million damage 
corresponds to a logarithmic cost of 7.778, which (after subtracting the 
average of the 1971-1977 median fire costs and dividing by the standard 
deviation) yields a standard normal variable equal to 4.37. This translates 
into a probability of only 0.0000062 that a fire randomly drawn from the 
log normal distribution will be this costly; if fires occur at the average 
rate of 200 per year, there is only a 50 percent chance that a fire this 
damaging will occur in 550 years corresponding to 1971-1977 experience: 
(1-0.0000062)550(200) = 0.506. 

Are there any other fires besides the New York one which are not 
consistent with the log normal distribution? Figure 3 depicts the 10 most 
costly fires in the Bell System during 1971-1977, compared with the 1971 
log normal predicted line. To allow for inflation, all fires have been 
translated in damage values to hypothetical 1971 levels using the cor- 
rection factors given in the final column of Table III. Clearly, the log- 
normal model fits all fires but the New York one reasonably well. 


2776 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1978 


yin FIRE EXPERIENCE 


0.999 NEW YORK 1975 FIRE 


0.998 


0.995 


0.99 


CUMULATIVE PROBABILITY 


0.95 








4 5 6 7 8 
FIRE DAMAGE IN LOG DOLLARS 


Fig. 3—Upper tail of fire damage distribution (10 most costly fires, 1971-1977). 


Instead of believing that an event this unusual has occurred, the 
statistician prefers to conclude that the New York fire is a sample of one 
drawn from a probability density function of fire damage different from 
the ones shown in Fig. 2. With such a small sample, it is impossible to 
derive the distribution function of fire damage in the underlying popu- 
lation, nor is it easy to say what fraction of Bell System fires come from 
such a population. Although detailed statistics of the damage distribu- 
tion of 3013 building fires during 1960-1970 are not available, the pattern 
of the most costly fires in that 11-year period ($1.67 million, $0.70 million, 
$0.50 million, $0.40 million, $0.19 million, $0.18 million, $0.15 million, 
$0.14 million, $0.12 million, ...) is consistent with Fig. 2. Therefore, the 
fraction of building fires in the Bell System that do not follow this 
- damage pattern is very small—one out of 3013 + 1483 = 4496 fires in 18 
years! The best estimate of the fraction is 0.0002, and a 95-percent 
confidence interval enclosing the true but unknown fraction is 
(0.0000056, 0.00125). 

It is hazardous to characterize the variability of the cumulative dis- 
tribution function of fire damage based on only seven fitted lines, one 
for each year. The standard deviation of the difference between an ob- 
served percentile and its fitted value is approximately 0.07 in log dollars, 
for percentiles between 50 and 90. However, this is a misleadingly small 
number if one is interested in predicting the fire damage corresponding 
to a specified percentile of the cumulative distribution in a future year. 
As already noted, there is an increase of about 10 percent per year in fire 
costs, but the year-to-year fluctuation of the fitted lines around this 
10-percent rate is considerable. For example, if one attempted to predict 
the fitted median fire damage of 1972 on the basis of the fitted median 
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fire damage in 1971 with 10-percent inflation added, one would have 
overestimated by 0.12 in log dollars—inflation suggested an increase of 
0.04, but in reality the 1972 fitted line was 0.08 below the 1971 one. Other 
year-to-year errors are even larger, and a rough estimate of the standard 
deviation of the error in estimating the following-year fit from the pre- 
ceding-year one is 0.2 in log dollars. The standard deviation of the error 
in predicting an actual percentile in a future year is the root-mean-square 
sum of these two standard deviations, or again about 0.2 (since the 
smaller error is swamped by the larger). In dollars, this corresponds to 
a multiplicative factor of 1.6; thus, if one estimates a future fire damage 
to be (say) $1000, there is only a two-thirds chance that the actual fire 
damage will lie between $600 and $1600. If one is interested in predicting 
extreme percentiles, such as the damage of the largest fire to be expected 
in a future year, the errors are likely to be far larger. 


2.3 Taxonomy of fires 


Bell System building fires can be classified in a number of ways. Table 
V and Fig. 4 present a hierarchical classification in which fires are first 
sorted out by place of origin (under control of Bell System employees, 
or not under their control), then by fire type, and finally by the equip- 
ment in which it originated. Fire type is related to, but not identical with, 
the well-known classification of fires by fire extinguisher type: paper 
fires, electrical fires, oil and grease fires. A fire needs three things to ig- 
nite—oxygen, a fuel, and a source of heat; if the fuel is especially volatile, 
almost any source of heat will do the job, but if it is less volatile, the 
particular source is of greater concern. The following two-level de- 
scription of fire type has been adopted: 

Fires with volatile fuels (oil, gas, gasoline, grease, etc.) regardless of 

heat source. 

Fires with less volatile fuels (paper, wood, insulation, etc.). 

Electrical sparks or short circuits. 

Overheating (placing a flammable substance too near a properly 
functioning heat-producing source, as a chimney or space heat- 
er). 

Heat-generating tools (used too near flammable substances). 

Smoking and matches (whether deliberately set or accidental). 

Classification is not always as simple as this would suggest; for instance, 
electrical malfunction sometimes results in an overheated resistor which 
actually starts the fire (these fires have been classed as electrical). Note 
that fires in certain equipment can appear in several different places in 
Table V or Fig. 4. For example, a fire in a furnace will appear under 
Volatile (Fuel Oil) if this is involved; otherwise, it will appear under 
Hlectrical (Building Equipment, Furnace). Similarly, a fire in a stove 
can appear under Volatile (Grease), Overheating (Stoves), or Electrical 
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Table V — Explanation of fire taxonomy 


External to Bell System: earthquakes, lightning strokes, power wire crosses and surges, 
fires beginning on non-Bell property, water main breaks—but not fires caused by in- 
terruption of commercial power 

Internal to Bell System 
Volatile Fuels 

Fuel Oil: boiler explosions, fuel oil line leaks, oil in cans or on floor 
Gas: explosions of gas furnaces, propane heaters, gas pipeline breaks 
Gasoline: gas pumps, Bell System or employee vehicles located on Bell property—but 
not on assignment away from Bell property 
Tar Kettles: contractor fires 
Hot Grease: grease or fat associated with stoves and grills 
Other: floor sealers, adhesives, calcium hypochlorite, windshield washing solvent, 
butane lighter, anti-static spray, oxygen tank, lacquer thinner, etc. 
Nonvolatile Fuels 
Overheating 
Furnaces, Heaters: gloves on furnaces, cartons stored nearby—not including fires 
of volatiles 
Stoves: papers in vicinity, coffee pot overheating, food bag in microwave oven—not 
including fires of volatiles 
Light Bulbs 
Engine Exhaust: ignition of building structure adjacent to emergency engine ex- 
haust pipe 
Heat-Generating Tools: acetylene and propane torches, soldering irons, grinding 
wheels, Cadwelders (including hot solder deposited in waste containers)—not in- 
cluding fires involving volatiles 
Smoking, Matches 
Trash Containers: ashtrays, wastebaskets, janitor carts and bags, scrap wire bags, 
rubbish rooms, trash compactors 
Loose Paper: fires in paper or wood scraps not in trash containers (often regarded 
as due to arson) 
Mops, Cloths: fires in janitorial closets caused by mops picking up smoldering 
cigarettes 
Chairs, Beds, Drapes: fires in upholstered furniture in lounges or quiet rooms (often 
attributed to smoking) 
Cable Well Bags 
Paper Records, Cartons: a heterogeneous category including fires in paper supplies, 
books, bulletin boards, etc. 
Nonpaper Supplies: fires in stored telephone supplies containing no obvious paper 
or cardboard (often regarded as arson, as ignition by cigarette is not easy) 
Outside Fires 
Trash Containers: truck-away containers in parking lots; piles of loose lumber or 
trash associated with construction activity 
Grass, Shrubs 
Vehicles, Telephone Equipment: night deposit boxes, cartons stored outside, 
employee cars, cable reels, plastic conduit (often regarded as arson) 
Construction Activity: miscellaneous fires in construction areas not obviously 
associated with volatiles (tar kettles), trash piles or heat-producing tools (usually 
attributed to smoking) 
Construction Supplies, Roofing 
Electrical 
Building Power 
Vaults, Transformers: commercial power entrance facilities, including transformer 
vaults and entrance ducts 
Panels, Electrical Closets: main commercial power switchboard, and branches 
terminating in panel boxes (wall-mounted, sometimes in separate closets) 
Local Wiring: fires in distributive wiring of commercial power, including plugs in 
sockets—but not fires in appliances or known to be in fluorescent lights 
Building Appliances 
Fluorescent Lights: defective ballasts 
Local Air Conditioning: window air conditioners or free-standing room units, hu- 
midifiers 
Heaters: either portable or wall-mounted types 
Fans: ceiling-mounted exhaust fans, pedestal fans, portable fans 
Building Equipment 
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Table V (cont) 


Central Air Conditioning: fan motors, compressors, condensers, chilled water 
pumps 
Elevators: motors, control circuits (including dumbwaiters) 
Furnace: fires not obviously associated with volatiles, including mechanical failure 
also (belt leaving sheave) 
Other: air dryers, vacuum pumps, motor controllers and control centers, ultrasonic 
cleaning machine, electric toilets, fire pump control cabinets, garage air com- 
pressors, portable battery chargers, sump pumps 
Food Appliances: coffee pots, stoves, portable defrosters, refrigerators and freezers, 
water coolers, sandwich and vending machines—not including fires attributed to 
volatiles or overheating 
Office Appliances 
Copiers: fires in office copiers, including ones which are attributable to paper jams 
as as electrical malfunction (source often difficult to determine from re- 
port 
Computers, Calculators: desk calculators, computer processing equipment 
Teletype: those not directly associated with switching equipment 
Other: offset press, envelope inserter, CRT service order machine, enclosing ma- 
chine, assignment wheel, typewriter, conveyor belt motor 
Automobiles: fires not associated with gasoline 
Telephone Equipment 
Cable Vault: fires in cabling (usually in open splices) 
Power Room Equipment 
Emergency Engine: load boxes, alternators, start motors, switches and related 
controls—not fires caused by overheating of exhaust duct 

Battery: electrolytic leakage or cell overheating, and fires in associated cir- 
cuitry 

Generator: in motors or associated control circuitry 

Rectifier: includes converter and inverter fires 

Power Plant: fires in 1380-volt power panels, or in general power controls such 
as the 412B power plant or Uninterrupted Power Source equipment 

Cabling: principally in DC power cables, and often due to craftsman error 
Switchroom Equipment 
Main Distributing Frame: usually fires in open splices 
Test and Operator Boards 
ESS Switchers: includes TSPS, and closely associated equipment such as tele- 
typewriters 

Carrier: primarily N and T carrier, and often in unattended remote locations 

Radio: includes mobile radio, and often in unattended remote locations 

EM Switchers: includes closely associated equipment such as teletypewriters; 
fires usually in relays, markers, fuses, step-by-step switches, line finders, 
etc. 

Other: fires in auxiliary equipment, and incompletely identified switchroom fires 
(most of these probably associated with EM switching) 


(Food Appliances), depending upon the nature of the fire. Note also that 
this taxonomy cannot be used to determine the number of fires that occur 
in a given type of Bell System space (switchroom, power room, utility 
room, cafeteria, hall, etc.); in general, a wide variety of different fires can 
occur in a given location. 

In Table V and Fig. 4, 67 building fires occurring in Bell Canada from 
1971 through 1974 are also included to provide as large a statistical base 
as possible. 

Figure 4 depicts the relative frequency of different types of Bell Sys- 
tem building fires. The relative seriousness of these different types is 
presented in Figs. 5 and 6, in which the damage of each fire type is plotted 
(on triangular graph paper) according to a trinomial probability density 
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Fig. 5—A trinomial distribution of fire damage. 


function: the fraction of fires less than $32, the fraction of fires between 
$32 and $320, and the fraction of fires above $320. (These limits were 
selected because they divide all Bell System building fires into ap- 
proximately equal parts.) Thus, points plotted near the lower left corner 
of Fig. 5 or 6 correspond to fires with typical damage under $32, and 
points plotted near the lower right corner, to fires with typical damage 
over $320. 

By using Figs. 4, 5, and 6 in concert, one can learn a great deal about 
the impact of different fires upon the Bell System. For example, inside 
trash container fires are relatively common (207 fires in seven years), 
but rarely cause much damage (73 percent under $32); on the other hand, 
battery fires are considerably rarer (19 fires in 7 years) but are far more 
costly when they do occur (53 percent over $320). 

If fire frequencies are examined year by year for each of the categories 
in the taxonomy, few patterns of interest emerge. However, the decline 
in fires related to construction and installation activity is noteworthy. 
Although overall construction dollars discounted for inflation have re- 
mained in a narrow range from 1971 through 1977 ($5.6 to $6.8 billion, 
in 1967 terms), construction-related fires have declined in every year, 
from 44 in 1971 to 7 in 1977. The four most relevant categories are listed 
in Table VI. 
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Table Vi — Decline in fires related to construction activity 


1971 1972 1973 1974 1975 1976 1977 


Outside fires (construction activity) 6 1 3 0 0 1 0 
Outside fires (construction material) 6 3 2 2 0 1 0 
Heat-generating tools 24 20 18 15 10 8 5 
Power room equipment (cabling) 8 12 2 2 5 3 2 

Total 44 36 25 19 15 13 7 


lll. SPECIAL STUDIES OF BUILDING FIRES 

This section of the paper shows how the information on the fire report 
form can be used to carry out various studies involving specific aspects 
of Bell System building fires. In particular, four topics are introduced: 
(i) an analysis of employee injuries and service interruptions caused by 
fires, (it) the correlation of business hours with fire frequency and the 
correlation of building occupancy with fire severity, (ii) the methods 
employed in fighting Bell System building fires, and (iv) an analysis of 
correlated fire events in buildings, and the cluster of fires in the New 
York City area in March 1975. These topics are meant to be illustrative 
rather than exhaustive; others could easily be developed. 
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3.1 Definitions of fire damage 


Dollar loss is, perhaps, the most well-known definition of fire damage, 
but it is not the whole story. In the Bell System, two other measures may 
also be appropriate: 

(t) With the strong Bell System emphasis on safety on the job, it is 
of interest to study injuries to employees occurring as the result of 
fires. 

(it) With the strong Bell System emphasis on reliable service to the 
customer, it is of interest to study service interruptions occurring as the 
result of fires. 

Of the 1483 fires between 1971 and 1977 (including Bell Canada) for 
which reports were filed, one fire resulted in the death of a contract 
(non-Bell) worker servicing an air-conditioning system and the injury 
of three Bell System employees, and 19 other fires resulted in the injury 
of a total of 5 contract workers and 16 Bell System employees. There are 
several possible reasons for the relatively high injury rate of contract 
employees: They may typically work with more volatile substances (eight 
of the fires, including four of the five fires involving contract personnel, 
were characterized by explosive ignition of highly volatile liquids or 
gases), and they are less likely to be well-instructed in safe working 
procedures and well-motivated to follow them than Bell employees. 

These injury statistics can be put in broader perspective by comparing 
them with the 1976 estimates of fire injuries and deaths throughout the 
United States prepared by the National Fire Protection Association.* 
There were 2.94 million fires in the United States resulting in 108,000 
injuries and 8,800 deaths; if Bell System fire experience was comparable, 
the 1483 reported fires would have resulted in 55 injuries (instead of 24) 
and 4.5 deaths (instead of 1). Looking at the data from a different per- 
spective, 600,000 operating company and contract employees working 
8 hours per day, 5 days per week, have 1/1470 of the potential exposure 
to fire of 210,000,000 United States residents living 24 hours per day, and 
therefore should incur 73 injuries and 6 deaths per year (instead of 3.4 
and 0.1, respectively). Even though the environment and characteristics 
of United States residents and Bell employees are markedly different, 
it is clear that the Bell System has an excellent safety record with respect 
to fires when they do occur. 

Service interruptions due to building fires, although rare, are slightly 
more frequent than injuries. Interruptions can broadly be divided into 
two classes: 

(i) Fires that destroy interoffice trunk circuits resulting in possible 
delays caused by increased congestion on alternate routes. 

(ii) Fires that deny service to individual telephones. 

It can be argued that the latter loss is of much greater importance to the 
Bell System; as long as the delays are not large, subscribers may not even 
be aware of the former impediment. 
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Clearly, some buildings are more likely than others to be the site of 
fires resulting in service impairment; garages and office buildings have 
little or no likelihood of this. More specifically, some areas of a central 
office are more vulnerable than others to service-impairing fires—the 
areas of greatest risk are those containing individual subscriber lines 
(cable vault, main distributing frame, first stage of switcher) or those 
which contain unduplicated equipment. 

If a fire denies service to individual telephones, a rough measure of 
its impact is the number of days of lost service multiplied by the number 
of exchanges affected, called exchange-days for brevity. (An exchange 
can contain up to 10,000 lines, not including extension telephones or 
PBXs, but the number of assigned lines in a typical exchange will be 
considerably less.) Fire reports do not call for information on service 
interruption, so that more precise measures than exchange-days are hard 
to calculate; often, this is only a rough estimate. Table VII lists the most 
serious service-impairing fires (as measured by exchange-days) en- 
countered between 1971 and 1977. In addition to the 22 fires in this table, 
20 more fires affected trunks, principally carrier and radio circuits, for 
various lengths of time. 


3.2 Some relationships between fires and people 


Two truisms associated with fires are: (i) fires are at least in part 
caused by human activity, and consequently are more frequent during 
those hours that a building is occupied, and (ii) fires are less costly if they 
can be detected and fought quickly. Thus, one expects a few costly fires 
at night or on weekends (or at unattended buildings, such as Community 
Dial Offices or repeaters), and numerous but inexpensive fires at at- 
tended buildings during business hours. To what extent do the data 
support these truisms? 

Table VIII shows there is a mild (but statistically significant at the 
0.002 probability level, using a chi-squared test of goodness of fit) 


Table Vil—Building fires which impaired service to individual 
subscribers 1971-1977 


Feb 27, 1975 New York, N.Y. 270 exchange-days 
Nov 10, 1971 New York, N.Y. 1.5 exchange-days 
Feb 11, 1971 Long Island City, N.Y. 1.0 exchange-days 
May 19, 1973 Peekskill, N.Y. 0.5 exchange-days 


In addition, there were 18 fires in operating companies which resulted in less than 0.1 
exchange-days of service impairment. 


Table VIII — Frequency of building fires by month 


January 116 April 139 July 133 October 132 


February 134 May 116 August 147 November 99 
March 144 June 105 September 112 December 114 
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NUMBER OF FIRES 


Table IX — Frequency of building fires by day of week 


Saturday 122 Monday 236 Thursday 282 
Sunday 96 Tuesday 240 Friday 253 
Wednesday 252 


seasonality to fires, with maxima in March and August and minima in 
June and November. Table IX demonstrates that there is a strong dif- 
ference in fire incidence between business days and weekends; weekend 
fires occur less than half as frequently. However, there are no statistically 
significant differences among the different weekdays. Finally, Table X 
and Fig. 7 exhibit a strong relationship between fire frequency and the 
time of day, with a minimum around 5 a.m. and a broad maximum 
around noon. Note that the fire incidence rises steeply in the morning, 
but falls off much more gradually at night. This function follows fairly 
closely the number of on-premise employees (including contract labor), 
with a delay factor to allow for the fact that a certain number of fires 
smolder awhile before being discovered. 


Table X — Time of day of discovery of building fires 


Night and Morning Afternoon and Evening 
12-1 30 6-7 30 12-1 97 6-7 74 
1-2 24 7-8 47 1-2 98 7-8 56 
2-3 26 8-9 77 2-3 102 8-9 55 
3-4 24 9-10 100 3-4 84 9-10 52 
4-5 16 10-11 96 4-5 108 10-11 43 
5-6 23 11-12 105 5-6 70 11-12 37 
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Fig. 7—Frequency of fires as function of time of day. 
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From 1971 through 1977 (Bell Canada fires 1971-1974 only), a total 
of 159 fires occurred in buildings unoccupied at the time the fire broke 
out. Of these, 44 fires self-extinguished, 4 were put out by sprinker sys- 
tems, and 1 by a Halon system. (In one of the remaining fires, a sprinkler 
system inside the building was activated by the heat of an external fire, 
but it played no role in putting the fire out.) Fitting a log probability 
density function by a least-squares line as described in Section 2.2 (to 
the 50, 60, 70, 80, 90, and 95 percentiles) the estimated median fire 
damage turns out to be 2.88 ($630), and the slope is 1.316 (in log dollars). 
In other words, the median damage of a fire in an unoccupied building 
is approximately 10 times as large as the median damage of a fire in an 
occupied building. Put another way, out of the 26 Bell System fires of 
at least $50,000 damage, 15 were in unoccupied buildings; this supports 
the statement that about half the Bell System building fire damage oc- 
curs in unoccupied buildings. 

Although Tables [X and X strongly suggest that fires occur less often 
in unoccupied buildings than occupied ones, it is difficult to establish 
a causal relationship. Telephone demand (and hence electrical activity 
in telephone central offices) also reduces at night and on weekends; could 
this result in fewer telephone equipment fires and building power fires 
as well, regardless of the number of people present? Furthermore, it is 
impossible to estimate the average number of fires per million square 
feet of unoccupied space per year unless one knows the areas of Bell 
System buildings as a function of their occupancy: f(x) square feet oc- 
cupied x or more hours per week, for 0 < x < 168. Unfortunately, these 
data are not available from the operating companies, and it would take 
a substantial effort to generate f(x) for the 20,000 or more buildings in 
the Bell System. If f(x) were known, it might be possible to predict the 
effect (with respect to fire) of such actions as dispersing Bell System 
switching equipment into a large number of small unmanned central 
offices close to the subscriber instead of a few larger central offices at 
a greater distance. 

It would be desirable to extend this study to see if there is any differ- 
ence in fire severity as a function of the distance to the nearest person 
in occupied buildings. Unfortunately, the fire report form does not give 
such information; however, the crude analysis given in Table XI may 
be suggestive. These figures should be interpreted with considerable 
caution, because the mixture of fires may not be the same; for example, 
fires detected by electrical means are likely to be expensive equipment 
fires, whereas fires detected by smoke or odor are likely to include a large 
number of inexpensive trash fires in addition to equipment fires. Thus, 
the fact that the median damage for fires detected by equipment or 
smoke alarms is greater than the median damage of fires detected by heat 
or odor should not be regarded as a demonstration of ineffectiveness of 
the former. 
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Table Xl — Median fire damage in occupied buildings 1971-1975 


Number Median 


of Damage 

Fires (Dollars) 
People in other room, fire detected by equipment or smoke alarm 92 200 
People in other room, fire detected by odor, noise, or light 205 30 
People enter room in which fire is located (for other reasons) 154 20 
People already in same room as fire 320 10 


3.3 Methods used to fight building fires 


During the 1971-76 period, Bell System regular or contract employees 
took action with respect to 1088 fires, either by fighting it themselves, 
calling the fire department, or both; this represents 86 percent of all 
building fires that occurred in that period. Fires in which employees were 
not involved are of two types: (i) those detected by outsiders who called 
the fire or police department or who (in one instance) extinguished the 
fire themselves, (ii) those detected by telephone people which were al- 
ready out, or which self-extinguished before any action was taken (and 
the fire department was not notified). 

Table XII shows that certain occupational groups—inside craft and, 
to a lesser extent, office worker and building mechanic—are the ones 
most likely to deal with Bell System building fires. Inside craft includes 
occupational titles such as switchmen, powermen, splicers, combina- 
tionmen, test deskmen, framemen, and central office maintenance; office 
workers include clerks, stockmen, cafeteria workers, service represen- 
tatives, engineers, and other white-collar occupations, including man- 
agement above supervision; building mechanics include titles such as 
building engineers, watch engineers, building maintenancemen, elevator 
mechanics, building electricians, and building technicians. Note that 
three occupations—construction, janitor, and guard—are likely to in- 
clude substantial numbers of contract employees. The occupation was 
not specified in 12 percent of building fires. 

Fires can be fought in many different ways, and these are summarized 
in Table XIII. Informal methods ordinarily involve blowing out or 


Table XIl — Distribution of Bell System occupations fighting fires 


Occupation Fraction of Fires 
Auto mechanic 0.02 
Janitor 0.06 
Office worker 0.14 
Inside craft 0.41 
Operator 0.03 
Guard 0.04 
Western Electric 0.04 
Construction 0.08 
Install/repair 0.04 
Building mechanic 0.14 
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Table XIll — Fraction of fires in which various fire-fighting 
methods were used 
Fire Department 
Not Call Call, not Need Call and Need 


Informal methods 0.16 0.05 0.01 
Extinguisher or hose 0.42 0.14 0.07 
Call fire department only - 0.03 0.12 


smothering the fire, throwing a glass (or a pail) of water on it, or turning 
off the electricity. The fire department was considered to be called and 
needed if they played a significant role in putting out the fire; if the fire 
was out (or almost out) when they arrived, and they assisted only in 
clean-up or smoke evacuation, they were considered to be called but not 
needed. A substantial fraction of fires are put out with the aid of an ex- 
tinguisher and no notification of the fire department; in only 20 percent 
of all fires was the fire department really necessary. These fractions 
remain much the same for paper fires or electrical fires, but volatile fires 
are less likely to be fought using informal methods without notifying the 
fire department (0.04 instead of 0.16), and more likely to involve extin- 
guishers combined with an unneeded fire department (0.21 instead of 
0.14) or a call to the fire department with no attempt to fight the fire (0.24 
instead of 0.15). : 

Bell fire-fighters using extinguishers or hoses almost always select the 
proper tools for the job; in only 13 cases was water apparently used (in 
whole or in part) on an electrical or a volatile fire. On the other hand, it 
is worth noting that women are infrequent users of extinguishers or 
hoses; out of 682 fires in which these tools were used, only 18 (about 2.6 
percent) involved women. 

Figs. 8, 9, and 10 (all plotted on triangular graph paper) give a more 
detailed look at the different fire-fighting exposures and techniques 
encountered by various occupations. Not surprisingly, auto mechanics 
and (to a lesser extent) installation and repair personnel (based at 
garages) encounter far higher percentages of volatile fires than the other 
groups; inside craft and Western Electric encounter more electrical fires 
than others, whereas janitors and (to a lesser extent) office workers, 
construction workers, and guards encounter paper fires. Operators and 
guards are far more likely to call the fire department than fight the fire, 
but inside craft, Western Electric and (to a lesser extent) construction 
workers, janitors, and building mechanics are unlikely to do so. Fires 
encountered by automobile mechanics, guards, or installation and repair 
personnel are the most likely to require professional assistance; fires 
associated with Western Electric or inside craft, the least. 
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Fig. 8—Occupational exposure to fighting paper-electrical-volatile fires. 


3.4 Correlated fire events 


Many Bell System buildings have had more than one fire in the 
1971-76 time period; in fact, one building has had 10. Statistical methods 
can be used to assess multiple fire events, to determine whether they are 
attributable to statistical fluctuations of fires occurring independently 
and at random, or to correlated events between fires. Correlated fires 
can arise for various reasons; the most common one is arson, but an 
undiagnosed electrical fault can lead to repeated occurrences of fire as 
well. 

There are two distinct ways in which the possible correlation between 
fires at a given building can be examined. First, if x fires have been ob- 
served, one can ask if they cluster in a small period of time, rather than 
spreading out over the entire period. Second, one can ask whether x fires 
is excessive for that building, given Bell System fire experience. This is 
a somewhat more difficult assessment, for one must decide how to nor- 
malize the building with respect to the Bell System. Floor area is the most 
plausible candidate, but in view of the relationship between fires and 
people exhibited in Section 3.2, the number of people in the building may 
be a better normalization. In any event, one must be quite cautious in 
deciding whether or not a given Bell System building is more fire-prone 
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Fig. 9—Fire-fighting techniques used by various occupations. 


than others, simply on the basis of a larger-than-average number of 
fires. 

If x fires have occurred in a building, the cumulative probability 
density function of the smallest time-spacing between any two consec- 
utive fires can be written® 


Prob(minimum spacing < t) = 1 — (1 —(n — 1)t)", 


where n is the total number of fires and ¢ is normalized with respect to 
the total time-interval (for example, if two fires occur 10 days apart in 
a 6-year period, t is equal to 10/2192, or 0.00456). If fires occur inde- 
pendently, and at random at a building throughout the time interval, 
this probability is distributed uniformly between zero and one; on the 
other hand, if there is correlation between fires (the occurrence of a fire 
raises the chance of another fire occurring in the near future), then there 
will be an excessive number of small values of the probability. Table XIV 
summarizes the probabilities associated with all multiple-fire buildings. 
The right-hand column clearly indicates that there are more buildings 
with small probabilities than with large ones; a chi-squared test of 
goodness-of-fit confirms that this result is not explainable by random 
fluctuation (at the 0.0000001 level). 
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Table XIV — Distribution of minimum time-interva! probabilities 
for Bell System buildings with two or more fires, 1971-1976 


Probability (minimum spacing St) Number of Buildings 

0-0.1 44 (17) 
0.1-0.2 26 (21) 
0.2-0.3 17 (15) 
0.3-0.4 17 
0.4-0.5 10 
0.5-0.6 15 
0.6-0.7 8 
0.7-0.8 8 
0.8-0.9 10 
0.9-1.0 ‘9° 

164 (130) 


However, the inflation in probabilities does not extend beyond 0.3, 
for a similar chi-squared test on the last seven values yields a value of 
only 6.91, significant at the 0.3 level. 

If one examines the fire reports for those 87 buildings for which the 
probabilities are less than 0.3, it is not hard to identify pairs of fires that 
appear to have some common factor. If these 34 buildings (20 percent 
of all multiple-fire buildings) are subtracted from the total, the paren- 
thesized values in Table XIV result, and the corresponding chi-squared 
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test of goodness-of-fit is far more plausible under the hypothesis of 
randomness and independence: it is 14.46, significant at the 0.11 level. 
In other words, it is possible to detect most, if not all, of the correlation 
in multi-fire buildings by a reading of the fire reports; there does not 
appear to be much additional correlation present. 

Most of the correlated events in the 34 buildings removed from the 
analysis are arsonous in nature; only six appeared to have other 
causes: 

(t) A fire in the power panel of the turbine room of one building oc- 
curred within 10 minutes of a fire in the turbine sensing unit in a 
neighboring building. 

(it) Two fires 14 days apart occurred in the AC busway serving an ESS 
office. 

(tit) An electrical fire in aisle 31 on the third floor of a mobile radio 
center was followed 98 days later by an electrical fire in aisle 32 on the 
same floor. 

(tv) Two fires 26 days apart were caused by careless use of a cutting 
torch during modification of a building by a contractor. 

(v) A fire in a coffee urn in a ladies’ lounge was followed by a fire in 
a stove in the same lounge 10 days later. 

(vi) A fire in a 48-volt generator in a power room was followed 21 days 
later by a fire in a 24-volt generator in the same room. 

Unfortunately, this statistical technique cannot be used to identify 
arson in Bell System buildings if the arsonist acts only once; many such 
fires can be effectively made to look like accidents (for example, a ciga- 
rette carelessly thrown into a wastebasket). It is only when the arsonist 
strikes twice within a reasonably short period of time (say, six months) 
that his presence is almost always suspected. 

In principle, a similar statistical analysis could be performed on the 
number of fires in each building in the Bell System normalized with re- 
spect to floor area (or other indicator of size or activity); however, floor 
area data on the 20,000 or more buildings in the Bell System is widely 
dispersed and not readily available for analysis. To give some flavor of 
the possible calculations, Table XV presents statistics on all buildings 
in the Bell System having eight fires or more during 1971-1976. The 


Table XV — Buildings in the Bell System with eight or more fires, 


1971-1976 
Floor Area Obs. Exp. Pr(fires 

(thous. sq. ft.) Fires Fires = observed) 
Fresno, Cal. 179 10 0.76 0.00000001 
Detroit, Mich. 769 9 3.25 0.0063 
Manhattan, N.Y. 516 8 2.18 0.0019 
Bronx, N.Y. 328 8 1.39 0.00010 
Washington, D.C. 167 8 0.71 0.0000008 


Bell System: 1194 fires, floor area 282.1 million sq. ft. 


BELL SYSTEM BUILDING FIRE ANALYSIS 2793 


expected number of fires in Table XV is calculated by multiplying the 
total fires in the Bell System by the ratio of the building floor area to the 
Bell System floor area; the probability of observing this number of fires 
or more, given the expected number, is calculated by means of the 
Poisson probability density function 


Pr(x or more fires) = D exp(—A)A/i!. 


In four of these buildings, a reading of the fire reports clearly points to 
an arsonist at work. The fifth building, in Detroit, has no obvious pattern 
of fires, but it is likely that an event of probability 0.0063 could have 
occurred by chance, given the large number of Bell System buildings. 
(In a list of 20,000 buildings, there is a 50-50 chance that the probability 
in the final column of Table XV will be less than 0.000025 in at least one 
case, even assuming all the buildings have the same underlying fire 
propensity per square foot. To assess the correctness of this assumption, 
one would have to look at the probabilities associated with all of the 
buildings.) Furthermore, the minimum spacing between any pair of fires 
in the Detroit building (12 days) is not unusual; a random sample of 9 
fires will produce a shorter minimum spacing 33 percent of the time. 

The typical fire in the Bell System receives very little publicity; usu- 
ally, only a few of the workers in the building know about it. (Since 58 
percent of all fires do not involve the fire department, newspaper cov- 
erage is likely to be sparse.) It is of interest, therefore, to assess the impact 
of a Bell System fire which generated enormous publicity in a metro- 
politan area—the February 27, 1975 fire at 204 Second Avenue in 
Manhattan. Among other things, newspapers reported a rash of fires in 
other telephone buildings in the area during the month that repairs were 
being made; it was suggested that the fire publicity might have encour- 
aged latent arsonists elsewhere in New York Telephone Company. 

One can examine the fire data to see whether or not such an allegation 
is true, or whether the number of fires that occurred in the next month 
can be explained as a not-untypical fluctuation in the pattern of fires 
over the entire seven years. Table XVI gives the number of months (out 
of 84) in which 0, 1, 2,--- fires were observed in Manhattan, in all five 
boroughs of New York, and in the Greater New York metropolitan area 
(specifically, all fires in the five boroughs, in the Nassau and Westchester 
operating areas of New York, and in the Essex, Raritan, and Hudson 
operating areas of New Jersey). The “Obs” column gives the actual 
number of months that the indicated number of fires were observed, and 
the “Poi” column gives the expected number of fires if a Poisson prob- 
ability density function is fitted to the data 

The one-parameter Poisson distribution does not fit the data partic- 
ularly well; in fact, a chi-squared test of goodness-of-fit rejects the model 
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Table XVI—Distribution of number of fires by months, 1971-1977, 
in the Greater New York area 


Number of Months with Indicated Fires in 


Manhattan 5 Boroughs Greater N.Y. 

Fires Obs Poi NB Obs Poi NB Obs Poi NB 
0 44 37.8 41.0 26 19.7 22.8 15 9.2 12.9 
1 20 30.2 26.5 23 28.5 26.8 17 20.3 20.6 
2 13 12.0 11.1 13 20.7 18.3 23 22.5 19.4 
3 7 K 3.8 17 10.0 9.5 8 16.6 13.9 
4 3 3.6 4.1 12 9.2 8.5 
5 2 1.1 1.6 4 4.1 4.6 
6 4 1.5 2.3 
7 1 0.5 1.1 
m 0.80 1.45 271. 

s2 0.98 1.79 3.06 

r 3.44 6.19 5.78 

c 31 4,27 2.61 


at probability level of 0.03, 0.04, and 0.06, respectively. There is some 
evidence that fires tend to cluster in months more than a Poisson model 
would predict; note that the number of months with zero fires or with 
a large number of fires generally exceeds expectations. In such a situa- 
tion, the two-parameter negative binomial distribution (also known as 
the Polya distribution, and often used in studies of accident-proneness) 
provides a better fit to the data. In the negative binomial, it is assumed 
that m, the mean of the Poisson distribution, is itself distributed ac- 
cording to the gamma distribution c’m’—! exp(—cm)/T(r). The proba- 
bility of 0, 1, 2, 3, --- observations in a cell is given by the successive terms 
of the series 


¢ tt rrr tt) rir + U(r + 2) 
(i) e+ 1’ M(e+1)2’ 3e+18 7") 


where c and r are estimated from the mean and variance of the data by 
the formulas 








c=m/(s?—m),r=cm. 


The negative binomial fit to the data is given in the “NB” column of 
Table XVI; the fit is considerably improved. 

As far as fire reports are concerned, March 1975 (the month following 
the Second Avenue fire) witnessed three fires in Manhattan, one in 
Queens, and one in the suburbs. (These numbers do not tally exactly with 
newspaper-reported fires for several reasons: One Manhattan building 
fire inexplicably failed to generate a fire report, and a couple of fires 
occurred on customer premises or in outside plant, which are not covered 
by the fire report; on the other hand, one fire included here was not re- 
ported to the fire department and did not appear in the papers.) Using 
the Poisson model, the estimated probability of three or more fires in 
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Manhattan in one month is 0.048; of four or more in all five boroughs, 
0.060; of five or more in the Greater New York area, 0.074. Using the 
negative binomial model, these probabilities increase to 0.064, 0.079, and 
0.102, respectively. There is some evidence that March 1975 was an 
unusually busy month for telephone building fires in Manhattan; how- 
ever, there is less evidence that it was an unusually busy month for fires 
in the five boroughs or the Greater New York area. In other words, the 
influence of the February 27 fire upon building fire statistics during 
March decreases as ever-larger geographical areas are considered—a 
hardly surprising result. 


IV. CONCLUSIONS 


The Bell System has a very good record with respect to building fires. 
About 200 fires per year were reported between 1971 and 1977 in Bell 
System operating company buildings, or on roofs or grounds; of these, 
inside fires occurred at a rate of approximately 0.6 per million square 
feet per year. (However, unreported fires may increase this figure by 30 
percent or more.) All fires but the New York fire on February 27, 1975 
appear to be well modeled by a log-normal probability density function 
of damage with a median value of $30 to $80 (or a mean value of $5,000 
to $18,000); about one percent of all fires exceeds $100,000 in damage. 
The New York fire demonstrates that there is a small, but finite, chance 
of far more damaging fires; the best estimate of the probability of fires 
not following the log-normal damage distribution is 0.0002, based on 
1960-1977 experience of 4496 fires. A tenth of all fires and half of all fire 
damage occurs in building unoccupied at the time of the fire. There is 
considerable evidence that fires occur in clusters; about 20 percent of 
all multiple-fire buildings had two or more fires that occurred near in 
time under similar circumstances. Furthermore, the enormous news- 
paper publicity of the New York fire may have been responsible for a 
modest but statistically significant increase in telephone building fires 
in the Greater New York area during the following month. 
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Boundary Integral Solutions of 
Laplace’s Equation 


By J. L. BLUE 
(Manuscript received January 26, 1978) 


Although Laplace’s equation is simple, the region over which it is 
to be solved is often complicated. Both the shape of the region and the 
boundary conditions can induce solutions ® which are singular at iso- 
lated points on the boundary of the region. 

Boundary integral equation methods are well-suited to the problem, 
reducing a two-dimensional partial differential equation to a one- 
dimensional integral equation. Unfortunately, the standard boundary 
integral equation methods lead to an ill-conditioned set of linear 
equations, restricting the achievable accuracy in the approximate so- 
lution. 

This paper describes an improved boundary integral method. A new 
integral equation 1s derived. Laplace’s equation is reduced to solving 
two coupled, one-dimensional integral equations. The resulting linear 
equations are well-conditioned. 

A program package for solving Laplace’s equation has been devel- 
oped. The package solves Laplace’s equation in two dimensions or in 
three dimensions with axial symmetry. The region may extend to in- 
finity, and may be multiply-connected. In addition to smooth basis 
functions, the program automatically includes appropriate singular 
basis functions, greatly improving the achievable accuracy for regions 
with corners. 


1. INTRODUCTION 


Laplace’s equation frequently arises in modeling physical problems, 
especially in electromagnetism, in thermal flow, and in fluid flow. In two 
dimensions, Laplace’s equation is 


O26 O2b _ 
ox? = Oy? 


> 


2797 


Fig. 1—Region used in an analysis of an electrostatic lens. 


and in three dimensions, 
02h 02h 02H 
—+——=0 
ox? dy? dz? 
To complete the specification of a particular problem, a region on which 
to solve Laplace’s equation must be specified, plus boundary conditions 
on the boundary of the region. 

As compensation for the simplicity of the partial differential equation, 
the region over which Laplace’s equation is to be solved is often com- 
plicated. Figure 1 shows the region used by the author in an unpublished 
analysis of an electrostatic lens. The solution is singular at the re-entrant 
corners. (By singular, we mean that ® has a finite limit as the corner is 
approached, but that some derivatives of © do not have a finite limit.) 
The singularity is a consequence of the region itself, not of any particular 
boundary conditions. In fact, the solution is singular unless very special 
boundary conditions are prescribed. 

Even with a rectangular region, the solution can be singular at isolated 
points. Figure 2 is an example, a thin-film capacitor with metal top and 
bottom contacts. To obtain its capacitance, Laplace’s equation must be 
solved inside the rectangle. The boundary conditions are ® = 1 on the 
top contact, ® = 0 on the bottom contact, and zero normal derivative, 
o’/on = 0, on the remainder of the boundary. (The definition of the 
normal derivative is given in the next section.) At the center edge of the 
top contact, the solution is singular. 

Standard methods for elliptic partial differential equations include 
finite difference and finite element methods. Both methods require a 
grid, usually rectangular or triangular, everywhere inside the region. 
Thus the region must be bounded. Both methods are difficult to apply 
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p= 





p=Q0 


Fig. 2—Region used in an analysis of a thin-film capacitor. The potential is singular 
at the center of the top side. 


to complicated regions; if the true solution has singularities, accuracy 
is usually poor unless heroic measures are taken. Neither method is 
suitable for a package for general regions and boundary conditions. 

Laplace’s equation is the simplest elliptic partial differential equation, 
and has been the subject of a great deal of analysis. Special methods for 
Laplace’s equation are available, methods that do not work for general 
elliptic partial differential equations. 

Special methods for Laplace’s equation include the so-called “fast 
Poisson solvers.” ! They can quickly solve V26(x,y) = f(x,y) if the region 
and boundary conditions are sufficiently simple. However, even Fig. 2 
is not simple enough because of the mixed boundary conditions on the 
top boundary. The fast Poisson solvers have great utility for special 
problems, but are not appropriate for a general Laplace package. Recent 
research (Ref. 2, for example) indicates how these methods may be ex- 
tended in the future. 


1.1 The boundary integral equation method 


The most useful special method for Laplace’s equation is the boundary 
integral equation method. The basic method has been known for many 
years,** but has enjoyed a renewed popularity since the advent of large 
digital computers. A few representative references are Refs. 5 to 9. A 
two-dimensional partial differential equation is reduced to a one-di- 
mensional integral equation. Similarly, a three-dimensional partial 
differential equation can be reduced to a two-dimensional integral 
equation. The integral equation involves only the geometry and the 
values of ® and 0@/dn on the boundary. Multiply-connected regions pose 
no added difficulty. After the integral equation has been solved ap- 
proximately, another integral can be done to evaluate V@ and © at any 
point inside the region. 

The boundary integral equation method has been quite successful, 
providing fast and inexpensive solutions for Laplace’s equation in two 
dimensions. The usual implementation does have several difficulties. 
First, the integral equation is a Fredholm integral equation of the first 
kind for 0@/on, and consequently is ill-conditioned. (For either a Diri- 
chlet or a Neumann problem, a well-conditioned integral equation is 
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Fig. 3—Illustration of the definitions of s, xs, Vs, Ms, and F’. 


available, but not for mixed boundary conditions.) Matrices generated 
for approximate solutions to the integral equation are ill-conditioned, 
and O0®/dn cannot be found accurately. Second, three-dimensional 
problems with axial symmetry are essentially two-dimensional problems, 
but no provision is made for their solution. (The next two objections do 
not apply to Ref. 9.) Third, the unknown & and 0®/0n are approximated 
by low-order polynomials on sections of the boundary. Convergence 
requires many coefficients if accuracy of more than a few percent is re- 
quired. Finally, no provision is provided for dealing with singu- 
larities. 

The present paper describes an improved boundary integral method 
which counters all the above difficulties. Two coupled integral equations 
are used; the combination leads to a well-conditioned matrix. Both ® 
and 0®/dn can be obtained accurately. Higher-order approximations 
for the unknown @ and 0®/0dn are used. Corner singularities are recog- 
nized automatically, and special approximating functions are used. 
Axisymmetric problems are solved by the same program. Typical 
problems cost only a few dollars to run. A reliable estimate of the accu- 
racy of the approximate ® is available. 


1.2 The Laplace package 


The method described in this paper has been implemented in the 
Laplace program package. A user’s guide for the Laplace package, with 
several examples, is available separately.1° The program package is 
written in EFL,!! an extended Fortran language. The output of the EFL 
compiler is portable Fortran. 

The package solves Laplace’s equation in two dimensions or in three 
dimensions with axial symmetry. Two-dimensional regions must be 
bounded by straight-line segments. Three-dimensional regions must 
be figures of revolution whose cross section in the (r,z) plane is bounded 
by straight-line segments. The region may extend to infinity, but the 
boundary must not extend to infinity. The region may be multiply- 
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connected. On each line segment, either & or 0@/On may be specified 
as a boundary condition. In addition to smooth basis functions, the 
program automatically includes the appropriate singular basis functions 
greatly improving the achievable accuracy for regions with corners. 

Section II discusses the mathematical basis for the boundary integral 
equation method. Section III describes the implementation of the 
method. Possible extensions to the program package are discussed in 
Section IV. Section V has results for a sample problem. The appendix 
derives the new integral equation used. 


ll. INTEGRAL EQUATION FORMULATION 


In this section, Laplace’s equation is formulated as a pair of coupled 
integral equations. The two-dimensional partial differential equation 
is reduced to a pair of one-dimensional integral equations. 

We wish to solve 

2 4072 5, O°@ 

V2(x,y) a + dy? 0 (Pla) 
for (x,y) in aregion D with boundary I. D may be multiply-connected, 
in which case I has several distinct parts. For now, we discuss only the 
two-dimensional “interior” problem, with D a finite region. At the 
conclusion of this section, we discuss the two-dimensional “exterior” 
problem, with D an infinite region, and the three-dimensional axisym- 
metric problem, both interior and exterior. 

As in Fig. 3, let (x;,y,) be the coordinates of the point at arc length s, 
and denote ¢(s) = ®(x;,y;). We will use & for the potential of a general 
point, and ¢ for a point on I’. Let n, be the outward-pointing unit normal 
vector at s. For a point s not at a vertex of I’, define 


W(s) = lim  n,-V&(x,y). 
(x,y)—>(xs,¥s) 


The notation 0¢/dn, is also used for the right side of the above defini- 
tion. 

For the problem to be well-posed, a boundary condition must be given 
at each point of T’.!2 The Laplace package allows the specification 


o(s) = 6;(s) on part of I, say T; (P1b) 
W(s) = bo(s) on the remainder of I’, say 12. (Plc) 
For any fixed point F = (xz,yr), the Green’s function, or fundamental 
solution to Laplace’s equation, is 
G(x,y;xp,.yr) = —Yeln [(x — xp)? + (y — yr)?I. 
Except at point F, V2G(x,y;xp,vr) = 0. 
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Now let (x,y) be any point strictly inside D. Green’s boundary identity 
is? 


onb(x,y) = f. [V(s)G (xpys32,7) — O(s)ng « VeG (g,vesx.y)]ds. 


The gradient operator, V,, operates on the x, and ys. The above equation 
is usually abbreviated as 


2rB(ny) = [ vg - 466) 2] ds, (1) 


with the arguments of G left implicit. 
If (x,v) is a point at arc length t on a smooth part of I, it may be 
shown®:!8 that 


ro= f | ve =) al ds. (2) 


The integral is now a Cauchy principal-value integral at s = t. 
Suppose that the correct ¢(s) and ¥(s) are not known, but only ap- 
proximate values #*(s) and ~*(s) are known. Then the function &* (x,y) 


defined by 





0G | as 

Ons 

exactly obeys Laplace’s equation for (x,y) strictly inside D. &* will not 
obey the correct boundary conditions as (x,y) approaches the boundary 
unless ¢*(s) and y*(s) are chosen correctly. 

Thus the boundary integral equation method is one of the class of 
“particular solution” methods.!4:!5 Any approximate solution obeys the 
partial differential equation exactly, but only obeys the boundary con- 
ditions approximately. The advantage over the usual particular solution 
methods for Laplace’s equation, as seen in Ref. 16, for example, is that 
the boundary integral particular solutions incorporate the exact 
boundary of the region and do not require a restricted region. They are 
more complicated to calculate, but are appropriate for the region. 

Equation (2) may be used to obtain an integral equation for ¢* and 


Y*. 


Qrb* (x,y) = f. | vr(sG — $*(s) 


ror= | vree ~ $*(s) a] oi. (3a) 


Ss 


Letting R be the vector from point t to point s, and R the length of R, 
(3a) may be written as 





ror() = F [orem Rr) |as@b) 
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This boundary integral equation has been used for many years for solving 
Laplace’s equation. 

For example, if ¢(s) = 6;(s) is given on all of I (Dirichlet problem), 
set *(s) = $(s), and the above is then an integral equation for the un- 
known y*(s). Thus a two-dimensional partial differential equation has 
been reduced to a one-dimensional integral equation. An approximate 
solution may be obtained by expanding y*(s) in an appropriate set of 
basis functions, and taking a finite number of these. 


N 
Y*(s) = x ajf;(s). 
= 


The f’s are piecewise constant functions in Ref. 6 and piecewise quadratic 
in Ref. 7. We discuss an appropriate set of f’s later. The integral equation 
(3a) then becomes 


N oG 
2 a; F fi(s)G ds = +. bi(s) ds + abi(t). 


If M = N points ¢; are chosen at which to make this equation hold exactly 
(collocation), a set of N linear equations for the N unknowns, aj, is ob- 
tained. If M points t;, M > N, are chosen, an over-determined set of 
linear equations is obtained for the a;’s. This reduces the sensitivity of 
the approximate solution to the exact choice of the t;. The equations 
are 


N 
> Aja; =ri,J = 1,2,...,M, 
r=) 

where 


Ai = +. fi(s)G(s,t;)ds 


n= bis) ONY edict): 
r ons 
These may be solved in a least-squares sense, say, by a standard sub- 
routine.!? 
In addition to the obvious advantages of this formulation, there is a 
well-known disadvantage. The integral equation for y*(s) isa Fredholm 
integral equation of the first kind,!® of the type 


1 
ule)= { Hix.yv)dy, 

where u and H are known and » is to be determined. This kind of integral 

equation is ill-conditioned (sometimes called ill-posed); it is difficult 


to obtain accurate solutions for v.1%:29 The reason for the difficulty is easy 
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to see. Since v appears only inside the integral, its high-frequency com- 
ponents are not well-determined; by the Riemann-Lebesgue lemma, 


1 
lim ; H (x,y) sin (ny)dy = 0, 
eae 
if H is not too badly behaved. The difficulty numerically is that the 
matrix {A jz} is ill-conditioned. Small errors in calculating elements Aj, 
or r; lead to much-magnified errors in the coefficients a;. If asequence 
of approximate solutions with increasing N is done, the larger matrices 
are increasingly ill-conditioned. The typical failure mode is that ¥*(s) 
does not converge as N increases, after a certain point; rather, spurious 
and unphysical oscillations in ¥*(s) are seen. 

Various methods of ameliorating the difficulty have been suggested, 
such as regularization”! and matrix singular-value decomposition. !9,20.22 
Better yet is to derive a Fredholm integral equation of the second 
kind. 

In the appendix, we derive the following identity for t any point ata 
smooth part of I. 


: BG she ae Oe 
rule) =F {¥4s) on 7 Ws) =o] ea ds. (da) 





For the Dirichlet problem, this identity leads to a Fredholm integral 
equation of the second kind for y*(s) and is not ill-conditioned. Ap- 
parently, but surprisingly, (4a) is new. The integral equation derived 
from (4a) may be written as 


ry*(t) = = {Lor(s) — $*(t)] 


” 2(n,;- R)(n;- R) — R2n, - n; 


7 +E yrs) ds. (4b) 


With two integral equations, one well-conditioned for ¢* and the other 
for ¥*, problem (P1) can be reduced to a set of linear equations with a 
well-conditioned matrix. If fitting point t; is on T'1, where ¢(s) is speci- 
fied, use (4). If ¢; is on I'2, where y(s) is specified, use (3). A coupled pair 
of linear integral equations results. Analogously to the Dirichlet problem 
discussed earlier, appropriate basis functions and fitting points can be 
chosen, and the problem reduced to a set of linear equations. Some of 
the complications will be covered in later sections of the paper. 

For the Dirichlet problem, ¢ given everywhere on I, (4) cannot be used 
everywhere. Since (4) is independent of the zero of potential, (4) alone 
will lead to a singular matrix, of rank N — 1. Special methods may be 
used for dealing with rank-deficient matrices, or the other equation, (3), 
may be used at some fitting points. 
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2.1 Exterior two-dimensional problems 


We now consider solving Laplace’s equation in an infinite region, D, 
exterior to a finite boundary, I’. To have a unique solution, it is insuffi- 
cient to specify either 4(s) or ¥(s) at each point of I’. In addition, the 
behavior of &(x,y) far from TI must be specified. Use standard polar 
coordinates, (r,0), with r= Vx? + y? and suppose 


fa DO) So eee OR: 
ee 2r r 
where ®., and W., are constants. If W.. is specified, then a unique solution 
can be found.!? W., is the negative of total flux extending to infinity. 

The earlier equations apply with small changes. For example, (3a) 
must be replaced by 


‘ 0G 
ao*(t) = 2r@., + mF | o*(s) i ye | ds, 
r on, 
and the unknown ©®., must also be found. 
The user specifies Y.. as well as boundary conditions. The Laplace 
package calculates an approximate value for ®., as well as for ¢* and y* 
on I. 


2.2 Three-dimensional axisymmetric problems 


Most of the preceding two-dimensional analysis needs only minor 
changes for the three-dimensional axisymmetric problem. Unlike the 
two-dimensional problem, the same formulation is adequate for interior 
and exterior three-dimensional problems. We use standard cylindrical 
coordinates (r,6,z). We wish to solve 

Vision 2 (, gts ee 
ror or r2 002 Oz? 
in an axisymmetric region D; D is formed as a figure of revolution by 
rotating a region D, with boundary I, about the z-axis. The boundary 
conditions must also be independent of 6. 
For any fixed point F = (xp,yr,z7), the Green’s function is 
1 


1 
Gey Ee) = 
(2.y,250 RY K28) [(x —xp)?2 + (y — yr)? + (2 —zp)?]1/2) Rs 


The integral equation corresponding to (38) is!8 


np (t) =F | ve =H) | dQ. 


Ss 


The integral is an area integral on the boundary, the surface of revolu- 
tion. 
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Without loss of generality, let t be at 9 = 0. Let I be the intersection 
of Q with any plane 6 = constant. Express the area integral as an iterated 
integral over @ and s, arc length along I’. Since ¢* and y* are independent 
of 6, all the @ dependence is in G and 0G/dn,. R3 may be expressed as 


R2 = R? — 2r,r,(1 + cos 6), 
where 
R? ae (rT + r)? a (zs _ Z4)?. 
With much manipulation, the @ integration may be performed, giving 
2no(t) = fF | 4r.K(m)ya(s) 
r 
2 | 4r,n,+ Ro 
R?,(1 — m) 


where e; and e, are unit vectors in the 6 = 0 plane, and Rg is the vector 
in the 6 = 0 plane from t tos. 


One Daas R= B(m)} |o*(s)] 2, (5) 


m 


Ry = (r, = ry)e; oc ee = 21)ez 
A4r.r; 
Ree 


K and E are the usual complete elliptic integrals.2° 


m= 


a/2 
E(m) = f, (1 — m sin? w)!/2du 


x/2 
K(m) = f, (1 — m sin? u)71/2du. 


The integral equation corresponding to (4) is considerably more com- 
plicated. 








2ry*(t) = wa {| be Re Z  2remy +e, ——— |y*65) 
rR, 1—m m 


+ | (a, *@r)(M; + er)(QH — K) — ns + ne 


4s Belen -Bp) (22— ml 


RG= wi Se) E~K) 


+ a [(n, + e-)(ny - Ro)r; — (n; + e-)(ns - Re)rs] 








E K-E 
x ( ea 


) | vo - ool] a. © 


K and E have been used as abbreviations for K(m) and E(m). 


1—m 
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2.3 Error estimates 


If (x,y) is strictly inside D, the approximate potential (in two di- 
mensions) obeys 


Qrb* (x,y) = f. | vr(s)G — $*(s) | ds. 
If (x,y) is on a smooth part of I, 
* = oe * 0G 
rO*(ny) =f | vree o*(s) [as 


If (x,y) is at a vertex of I’, the z is replaced by the interior angle of the 
vertex. Similarly, Vé* inside D and 06*/dn on TI may be obtained by 
the analog of (4). 

The function ®* as defined above exactly obeys Laplace’s equation 
inside D. Therefore, by the maximum principle,?4 the maximum error 
in &* occurs somewhere on I. 


b*(x,y) - H(x.9)| < max| #*(x,,y,) — o(s)1. 





For the Dirichlet problem, a rigorous error bound is in principle possible 
by finding the largest discrepancy between ®* and the boundary con- 
dition ¢. However, finding the error bound can be more expensive than 
solving the integral equation. 

For mixed boundary conditions, a rigorous bound is in general im- 
possible. The above bound is still correct, but is not useful, since the true 
® is not known on all of the boundary. For certain restricted regions, 
another rigorous error bound can be obtained.?° For these restricted 
regions, 


b* (x,y) — %(x.)| < max|S*(x,,y,) — 4(s) 
Ty 








— W(s)], 


+ Rp max 


o’* (x5,Vs) 
To fo) 


Ng 





where Fp is the maximum perpendicular distance from any point of Ig 
to any other point of I’. This bound is in principle possible to compute, 
but is expensive in practice. 

An error estimate is available at no extra cost in the Laplace package, 
because of the method of solution. The over-determined system of linear 
equations is solved in a least-squares sense, minimizing the total fitting 
error (TFE), 


& s (Ajja; — ne] 
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and returning this error. For the implementation discussed in the next 
section, with M ~ 3N/2, numerical experiments indicate that the TFE 
is areliable upper bound on the error in the potential on the boundary, 
and a substantial overestimate of the error away from the boundary. 


ill. IMPLEMENTATION 


Section II was general and discussed mathematics; we now become 
more specific and discuss numerical analysis. We also discuss some of 
the myriad details necessary to make a computer program feasible. 


3.1 Geometry 


Section II considered regions of arbitrary shape. The current imple- 
mentation of the Laplace package requires that I be composed of finite 
straight-line segments. This is in contrast to the usual practice in anal- 
ysis, of requiring that I! be asmooth curve everywhere. Many practical 
problems have corners in their geometries, so it is essential to be able to 
handle such boundaries. It is easier to analyze exact corners than 
“smooth” geometries with a very small radius of curvature rather than 
a corner. In the following, each of the straight-line segments is called a 
side. 


3.2 Basis functions 


In the previous section, the choice of the basis functions f; (s) was left 
arbitrary. However, the particular choice made strongly affects the ac- 
curacy and efficiency of the program. At least four factors should be 
considered. 

(t) The basis functions should be able to model the behavior of ¢(s) 
and y(s) with only a few functions. 
(iz) If enough basis functions are used, they should be able to ap- 
proximate ¢(s) and y(s) arbitrarily well. 

(it) The basis functions should be a well-conditioned set, so that small 
errors in doing the integrals do not lead to large errors in the 
approximate solution. 

(tv) The integrals of the basis functions times G, 0G/Odn,, and 
07G/d0n,0n; must be tractable, either analytically or numeri- 
cally. 

Historically, (tv) has been dominant. Symm® approximated curved 
boundaries by straight-line segments and used piecewise constant basis 
functions. Hayes’ allowed boundaries to be straight-line segments or 
arcs of circles, and used piecewise quadratic basis functions. Blue? al- 
lowed straight-line boundaries and allowed piecewise polynomial basis 
functions. For all these choices, the integrals in (3) and (4) can be done 
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Fig. 4—Local polar coordinates used for eq. (7). 


analytically. The above authors did not treat the axisymmetric problem, 
but the integrals in (5) and (6) are intractable analytically. 

If the boundary has corners, ¥(s) can be infinite at the corners, and 
piecewise polynomial basis functions will not be able to approximate y(s) 
well. For example, in Fig. 4, suppose the boundary conditions are as 
shown, and the interior angle is y,,. Choose polar coordinates (p,y) 
centered at the vertex, with angle y = 0 on the ¢ = 0 side; let s = sp at the 
corner. Then for small p, ® has an expansion (in two dimensions)?® 


foo} 


P(p,y) = Crp™ sin (any), (7) 


where 
2 (n — "y)a 
Ym : 


n 


Thus © is identically zero on the line y = 0. On the line y = ym, the 
normal derivative is identically zero. 
On the line y = 0, where sg 2 s, the normal derivative is 


¥(s) = Lo anCy(s — so)er7?. 
n=1 
For the case y,, > 2/2, we have a; < 1, and we expect ¥(s) to be infinite 
at the corner, unless C; happens to be zero. For ¥(s) to be approximated 
accurately with only a few basis functions, one of them should be a;(s 
— $9)%1—! with the correct a. 
Similarly, on the line y = ym, where s < so, the potential is 


(8) = YL Culso— s)e(—1) 


Therefore a basis function (sp — s)*1 is needed on the y = y,, side. In fact, 
only a single unknown coefficient, C,, need be introduced to deal with 
the worst part of ¢(s) and ¥(s) at so. It may also be desirable to include 
a few of the less singular basis functions. The Laplace package includes 
singular basis functions for which a < dQmax, With a default value amax 
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= 1, and does not include any singular function whose a is within 0.1 of 
an integer. 

Similar singular basis functions are used at corners where ¢ is specified 
on both sides and where y is specified on both sides. Since the expansion 
(7) is not necessarily convergent far from the vertex, the singular basis 
functions centered at a corner are used only on the two sides meeting at 
that corner. 

For axisymmetric problems, the expansion (7) does not hold, but the 
exponent of the singularity is the same for off-axis points, since the 
singularity depends only on the highest order derivatives in the differ- 
ential equation.?° For singularities on the axis, the exponents are dif- 
ferent,2’ and singular functions have not yet been implemented. 

Additional “smooth” basis functions are also needed. A well-condi- 
tioned family of basis functions is B-splines. A brief description of some 
of their properties follows.2829 B-splines are defined on a line divided 
into intervals by knots. B-splines of order k are piecewise polynomials 
of degree k — 1. Each B-spline is nonnegative, has exactly one maximum, 
and has local.support. The sum of B-splines at any point is identically 
one. In any interval, exactly k B-splines are nonzero; each B-spline is 
nonzero in at most k intervals. The B-splines on a line are uniquely de- 
termined by the knots, which may be multiple. At a knot with multi- 
plicity m, a kth-order B-spline has k — m — 1 continuous derivatives. 
If m = k — 1, the B-spline is only continuous; if m = 1, the B-spline has 
k — 2 continuous derivatives. 

The user.chooses a k, the same for all sides, and the number of interior 
knots on each side. Mesh spacing proceeds according to the following 
rules. A vertex is called singular if its expansion, as in (7), has a; < 0.9. 
If a singular basis function is used, the vertex is called compensated. If 
the two vertices delimiting a side are each either compensated or 
nonsingular, the interior knots on the side are spaced uniformly. Oth- 
erwise, the interior knots are spaced closer together near uncompensated 
singular vertices. 

For a given mesh, higher-order B-splines are potentially more accu- 
rate, since the approximation error can be O(h*) [30], where h is the 
maximum mesh. However, the integrals for higher order splines are more 
difficult, and there are more unknown spline coefficients for higher k. 
Currently, the Laplace package restricts k to be 2, 3, or 4. 


3.3 Boundary conditions 


If o(s) or ¥(s) is specified as an arbitrary function, the integrals in- 
volving the boundary conditions require special methods. Instead, the 
Laplace package does a least-squares fit of the boundary condition to 
a B-spline. A separate fit is done on each side; the same order and mesh 
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are used as specified by the user for that side for the unknown y*(s) or 
p*(s). 

For the Neumann problem—y specified on all of [—the problem is 
undetermined up to an additive constant in ®. A solution exists for the 
interior problem only if fy ¥(s) ds is exactly zero. The Laplace package 
currently will not solve the Neumann problem; ¢ must be specified on 
at least one side. 


3.4 Integrals 


With polynomial basis functions and boundaries composed of 
straight-line segments, the integrals in (8a) and (4a) can be done ana- 
lytically. This can cause conditioning problems; B-splines are a well- 
conditioned basis only if calculated properly.”9 The integrals in (5) and 
(6) cannot be done analytically, even with polynomial basis functions. 
With singular basis functions like (s — sg)*, none of the integrals can be 
done analytically unless a is special. 

All the necessary integrals can be done accurately and efficiently by 
the numerical methods described in this section. We first consider (8a) 
and (4a), for any fixed t. For straight-line boundaries, the integral over 
I is divided up into a sum of integrals over the line segments. We con- 
sider only a single segment, and eliminate any subscript referring to the 
segment. On the segment, n, is constant, and R may be written as 


R= Ring at (s = $1 )es, 
where e, is a unit vector along the side. Also expand n; as 
ny =n ny, + nes. 


The portions of (3a) and (4a) from the segments are 


* = o Ry - 
eO- Slee ree 


— "In [RZ + (s — 5 )°W*(s) |ds (3c) 


my*(t) =F {[o*s) — o*(t)] 
x 2R i [n Ry tny(s—s.)|—- [Ri + (s—s1)7%In1 
[Ri + (s>3)7/? 


n,R, +nj(s—s,) Z 
eer ance, (s)} ds. (4c) 


We first consider the case where the point t is not on the segment in 
question. Then the Green’s function parts of the integrals are not sin- 
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gular; either R ,; ~ Oors, is notin the interval [s,s2]. As an example, 
look at the integral with the logarithm in it, and look at one term of the 
expansion of y*(s), with the basis function B;(s). Further divide the 
segment into subintervals, between knots of the spline, so that over each 
subinterval B;(s) is a polynomial. Over each subinterval, the integrand 
is a polynomial times a nonsingular function. Gauss-Legendre quadra- 
ture is ideal for such integrands, if the order of the quadrature rule can 
be determined a priori. (An automatic quadrature method could be used, 
such as Refs. 31 or 32, but these are usually less efficient.) Computing 
the order of the quadrature rule necessary can be done using the results 
of numerical experimentation. If s;, and s;, are the ends of the sub- 
interval, let s. = (s;, + s;,)/2 and h = s;, — s;,. The change of variable 
u= 2(s — s,)/h changes the term in question to 


h 1 
= ff. [In (h2/4) + In [a2 + (u — b)?]]B;(s- + hu/2)du, 
4 J-1 


where a = 2R , /h and b = 2(s , —s,)/h. Since the integral is over a single 
mesh interval of B;, we expect the error to be no worse than the worst 
error in any of the kth-order B-splines with k-fold knots at —1 and 1, and 
no interior knots, since the latter B-splines vary more rapidly over the 
interval. Thus we look only at the errors in these k B-splines; call them 
B(u) to distinguish them. 

Now consider the family of integrals 


1 ae, 
1, (a,b,k) = A. In [a2 + (u — b)2]B;(u)du. 


Let E;(a,b,k,n) be the error in evaluating J;(a,b,k) by an n point 
Gauss-Legendre quadrature rule, and let 


k 1/2 
E(a,b,k,n) = | >» Bj(a,bskan)?| 
j=l 


Numerical experiments show that in the (a,b) plane, the locus of constant 
E(a,b,k,n) is approximately an ellipse. For given n and desired accuracy, 
¢, there is an ellipse with semi-axes A(k,n,¢) and B(k,n,«) so that the error 
is satisfactory if a and 6 are outside the ellipse, or 


La@mal *Laamal 4 


For doing the integrals J;(a,b,k) to accuracy e, the functions A(R,n,«) 
and B(k,n,¢) are determined experimentally for a series of values of n. 
(The default values are n = 4, 6, 8, 10, 12, and 16, and « = 10~®.) For any 
particular a and b, the smallest satisfactory n is used. If the largest n 
available is insufficient, then the interval is divided; this is seldom 
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necessary. In the Laplace package, A(4,n,«) and B(4,n,e) are used for 
k <4. 

The other integrals in (3a) and (4a) are done similarly, using numer- 
ically derived ellipses for B-spline basis functions. For singular basis 
functions, Gauss-Jacobi quadrature formulas are used; these are Gauss 
quadrature formulas on (0,1) with weight function x°—!. Different 
quadrature formulas are used for each of the unique a’s used in singular 
basis functions. The same ellipses as calculated for B-splines are used; 
slightly smaller ellipses could be used, but the gain in efficiency is 
small. 

The Gauss quadrature formulas are calculated portably by the method 
of Sack and Donovan,°3 using programs in the PORT library.*4 

The integrals of (5) and (6) are somewhat more complicated than those 
of (3a) and (4a), but are no harder numerically. The same ellipses are 
used. 

If point t is on the line segment, then the Green’s functions in the in- 
tegrals have singularities. The integrals (8c) and (4c) simplify somewhat, 
since n, = n;, nj = 0,n, = 1,andR, =0. The ¢* term in (8c) is iden- 
tically zero; the y* term is 


=i), i "In [(s — t)2]¥*(s) ds. 


The y* term in (4c) is identically zero; the ¢* term is 


a ae ds 
- fF" ere) - Ol SS. 





Special care must be taken to get accurate approximations to these 
singular integrals. 

First consider a B-spline basis function, B;(x), again dividing (s,s) 
into subintervals. If t is not in the subinterval in question, then the 
previous methods are adequate. (‘The Laplace package never takes t to 
be exactly at a knot.) For the logarithmic integral, the subinterval in- 
cluding ¢ is, for some positive 6; and 6g, 


at, f o In [(s — £)2]B;,(s) ds 
61 62 
= — f, In (u)B;(t —u)du — 7. In (u)B;(t + u)du 
0 0 
=— 6; f In (v)B;(t — vd,)dv — In (6,) f- B;(t — u)du 
= f in BG ped Oo) f Beto din 
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The integrals with In (v) are done by Gauss quadrature with weight 
function In (v); the others are done by Gauss-Legendre quadrature. The 
Gauss quadrature formulas with logarithmic weight function are also 
calculated portably by the method of Ref. 33. 

The Cauchy principal-value integral, for the subinterval including 
t, is 


— ct Bis) — Bit) a 
f, (s = t)? 
meee ss t+o1 ds t*82 Bi(s) — Bi(t) — (s — t)B’i(t) 
ee t-5, S—t I (s — t)? as; 


The first integral is done analytically. The second has no singularity at 
s = t and is done analytically as 


tt+doT 1 

— f, | Br" +—(s —t)B/"(t) +.. as. 
t—61 2 6 

This is adequate for low-order B-splines. For high-order B-splines, more 

care would be necessary. 

Now consider integrals with singular basis functions and with ¢ on the 
same segment as s. If ¥(s) is given on the segment, ¢*(s) may have (s. — 
$1)“ or (sg — s)* terms; however, if ¥(s) is given, (3) is always used for t 
on the side, and the ¢* terms vanish because R , = 0. If ¢(s) is given on 
the segment, ¥*(s) may have (s — s1)*~! or (sz — s)*—! terms; however, 
if 6(s) is given, (4) is almost always used, and the ¥* terms vanish because 
R | =0. The exception, when ¢(s) is given on a segment and (3) is used, 
occurs only for the Dirichlet problem, ¢ given on all of P. Then (3) is used 
at the central fitting point of each side, and we need integrals of the 
form 


Lip g ff. ”" tn [(s — t)2](s — 81)¢-1ds. 


1 


As much as possible of the integral 
t 
§ In (€ — s)(s — s1)*7!ds, 
$1 


starting from sj, is done using Gauss-Jacobi quadratures. The remainder 
has only a logarithmic singularity. It and the integral form t to sz are done 
by Gauss quadrature with a logarithmic weight function, as described 
earlier in this section. 


3.5 Complete elliptic integrals 


The complete elliptic integrals K(m) and E(m) are necessary for the 
axisymmetric problem. Suitable expansions are®® 
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K(m) = Px(1 — m) — Q,(1 — m) In (1 — m) 
E(m) = Pe(1 — m) — Qc (1 — m) In (1 —™m). 


Polynomial approximations for the Ps and Qs are given in Ref. 35. The 
argument (1 — m) is used instead of m to avoid excessive error as m —> 
1, 1.e., as s — ¢ in (5) and (6). 

The combination D(m) = [K(m) — E(m)]/m is also needed. As m > 
0, D(m) — 2/4. For D(m), another approximation of the above type was 
generated. 


3.6 Fitting points 


At least N fitting points are needed to determine the N unknown 
coefficients of the basis functions. The work to calculate the matrix is 
proportional to the number of points used. If more than N points are 
used, the sensitivity of the solution to the placement of the points is di- 
minished, as is the amplification of any small errors in calculating matrix 
elements. In the Laplace package, approximately f times N fitting points 
are used; the default value of f is 1.5. In the subinterval between each 
pair of knots, the number of fitting points is f times the number of un- 
knowns associated with the subinterval, rounded up. The fitting points 
are uniformly spaced within each subinterval. 


3.7 Scaling, constraints, and matrix solution 


Each row of the matrix corresponds to applying either (3) or (4) at one 
fitting point, t;. To keep the solution approximately independent of the 
scaling of the region, each row corresponding to (4) is multiplied by the 
length of the side containing ¢;. 

For an interior problem, fy*(s) ds = 0. For an exterior two-dimen- 
sional problem, fy*(s) ds = WY... Either restriction may be written as 
a linear equality constraint on the unknown coefficients. When the 
matrix equations are solved by QR factorization, such linear constraints 
can easily be enforced using a method described by Lawson and Han- 
son. 36 


3.8 Portability 


A portable stack allocation mechanism“ is used for all temporary 
storage. The program is written in EFL.!! The output of the EFL compiler 
is portable Fortran. 

Two parts of the program are not portable. For « # 10-6, new ellipses 
are necessary. The approximations to the complete elliptic integrals are 
accurate to about 1078. 
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IV. POSSIBLE EXTENSIONS 


In this section, we discuss several extensions to the Laplace package 
which could be implemented if there were sufficient incentive, and the 
difficulties involved with each. Combinations of the individual extensions 
pose further difficulties, but will not be discussed. 


4.1 Higher-order B-splines and higher-accuracy integrals 


B-splines of order higher than 4 are useful if very high accuracy so- 
lutions are desired. The only change necessary to allow higher-order 
B-splines or higher-accuracy integrals is to change the ellipse semi-axes. 
This feature was not included in the Laplace package, since calculating 
the ellipses portably for any specified accuracy and B-spline order re- 
quires too much code. Alternate methods for doing integrals to any 
specified accuracy are under consideration. 


4.2 Singular basis functions for axisymmetric problems 


At a vertex away from the axis of revolution, an expansion similar to 
(7) will hold; the exponents {a,,} are the same as for the two-dimensional 
problem with the same shape as the cross section of the figure of revo- 
lution. At a vertex on the axis of revolution, the exponents are different; 
on-axis singular functions have not been implemented. 


4.3 General linear boundary conditions 


In some applications, it is desirable to solve Laplace’s equation with 
the general linear boundary conditions on I 


a(s)p(s) + b(s)¥(s) = c(s), 


with a and b simultaneously nonzero. Then (3a), say, would become 


ro*(t) = $ Vigra rahe ell o*(s) + In () a ds. 


The difficulty here is in choosing a method for accurately evaluating the 
integrals involving a/b and c/b, unless a/b and c/b are restricted dras- 
tically, say, to being constants on each of the boundary line segments. 


4.4 Curved boundaries 


Many applications have part or all of the boundary as asmooth curve, 
which the user might not wish to approximate by straight-line segments. 
In principle, all that is needed to allow I to be any smooth curve is a 
parameterization of x,, y;, and n, as a function of s. Again, the difficulty 
is in doing the integrals accurately and efficiently. The ellipse method 
would not be directly applicable. In addition, some of the integrals which 
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Fig. 5—A region which could be handled more easily by breaking it into two regions with 
an interface. 


vanished identically for straight-line boundaries would not vanish. For 
example, the ¢* term of (3b) for s and t on the same segment vanishes 
if the segment is a straight line, because n,-R is identically zero. If the 
line segment is curved, n,- R/R2 in general has a finite limit as st, and 
this term needs to be kept. 


4.5 Interfaces 


In other applications, there may be interfaces. A common problem 
is V?@, = 0 in region D, with boundary I), V2. = 0 in region D» with 
boundary Is, and interface conditions on the common portions of I, and 
I',. Typical interface conditions are ¢; = ¢2 and ky = Kee, where x; and 
Ko are given constants. 

Implementing this extension would require a significant change in data 
structure, but otherwise would be easy. No new types of integrals would 
arise. The singular basis functions at corners which are also points on 
the common boundary depend on x, and kg as well as the angles.*” 

This extension would also be useful for some single-region problems. 
A typical example is Laplace’s equation inside a U-shaped region, Fig. 
5. This could be broken artificially into two regions as shown, with in- 
terface conditions ¢; = ¢2 and y = wo. The full region requires ap- 
proximately 3 N? integrals, if N is the number of unknowns. The two 
half-size regions would each have N/2 unknowns, plus a few extra for 
the boundary values on the dotted line. Each region would require 
somewhat more than 3(NV/2)2 integrals, so that the total work would be 
somewhat more than half. 
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?=1 y=0 


Fig. 6—Geometry for a sample problem. 


V. AN EXAMPLE 


Figure 6 gives the geometry and the boundary conditions for a sample 
problem. The boundary conditions are y = 0 on the light sides, ¢ = 1 on 
the bottom dark side, and ¢ = 0 on the top (L-shaped) dark side. This 
problem was solved approximately with third-order and fourth-order 
B-splines, and with various numbers of interior knots. Each side had the 
same number of interior knots. Some of the information is summarized 
in Table I. N is the number of basis functions. The running time is given 
in seconds for a Honeywell 6070 computer. TFE is the total fitting error, 
as defined in Section II. fds is over the side from 1 to 2. The next col- 
umn gives ¢ at vertex 3. The approximate ® at (4,14) is the final column; 
vertex 1 is at (0,0) and vertex 3 is at (2,2). The time goes approximately 
as N?; most of the work is in calculating elements of the matrix. Solving 
the matrix takes time proportional to N°, but the proportionality con- 
stant is smaller than that of the N2 term. Figure 7 is a log-log plot of TFE 
against N; TFE seems to be converging as N-? for third-order splines and 
as N~4 for fourth-order splines. These rates of convergence are the op- 
timum rates for approximating smooth functions by B-splines,®° it is of 
interest to see them apparently applying for nonsmooth functions. The 


Table | 
kord nknots N time TFE Svds ~ at3 (4,14) 
15 1.19 0.1067 0.997242 0.5482 0.6189 
21 1.98 0.0255 1.000287 0.5069 0.6196 


27 3.21 0.0231 0.999932 0.5011 0.6193 
33 4.91 0.0123 0.999951 0.4999 0.6192 
7.09 0.0080 0.999965 0.4997 0.6192 


21 1.85 0.0235 0.999838 0.5041 0.6192 
27 2.89 0.0135 0.999958 0.4997 0.6194 
33 4.58 0.0066 0.999956 0.4987 0.6193 
39 6.52 0.0034 0.999961 0.4995 0.6192 
45 9.29 0.0024 0.999968 0.4995 0.6193 


BPP PR COO OOOO 
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TOTAL FITTING ERROR 
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Fig. 7—Total fitting error (TFE) vs the number of basis functions, for third-order and 
fourth-order splines. Lines proportional to N~3 and to N~4 are shown for comparison. 


last three columns appear to have converged to the accuracy allowed by 
the finite precision of the calculations. Matrix elements are calculated 
to a relative precision of about 10-6, and the matrix has a condition 
number on the order of a few hundred, so accuracy of a few parts in 104 
is all that can be expected for boundary values. fy(s) ds can be more 
accurate, since the integration can average out the boundary errors. 

For these examples, the same number of knots was used on each side. 
Other examples may require differing numbers of knots on different 
sides. The intuition of the user is valuable in deciding on the number of 
knots per side. 


APPENDIX 


Derivation of Integral Equation (4) 


Equation (4) can be derived in various ways. We use a derivation 
modeled on the derivation of (3) as sketched in Ref. 13. Start with 
Green’s identity in two dimensions. 


f ff, (uV2v — vyV2u)dA = ff (u soe / ds. 
D r Ons ons 
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Fig. 8—Integral equation (4). 


This identity is usually stated to hold for u and v which are C2 inside D 
and C!in D + T, but is more generally true. For example, the condition 
on u can be weakened to include u’s which have corner singularities as 
discussed in the body of the paper. The region D need not be simply- 
connected. 

Pick any fixed point (x;,y;) at arc length ¢t on I’, at a smooth part of 
I. Let n; be the outward-pointing normal at t. Choose 


u(x,y) = &(x,y) ~ P(x2,V2) 
v(x,y) = ner VG(x,¥5x2,¥e), 


and apply Green’s identity to the region D’, which is D minus a sector 
of a circle, with radius «, centered at ¢ (Fig. 8). Let I” be the circle sector. 
In D’, V2u = 0 and V2» = 0, so the area integral is zero. 

Consider the I” integral, and use polar coordinates (7,0) centered at 
t. Let e, be the unit vector at (r,0) pointing away from the point t. On 
I’,n, = — e;, 


y= —nz- e,/¢, 
Ov/On, = —Nn;z + e-/e€?. 
Expand ®(x,y) about (x;,yz). For (x,y) on I’, 
B(x,y) = B(x,yt) + ce,» VB(x1,¥2) + OC), 


oP Oo® 
ee (x,y) SS Se Spe V&(x2,¥e) + O(e), 
on or 


where V ®(x;,y;) is an abbreviation for V®(x,y)|.,,y,. The integrals over 
I’ may be evaluated explicitly in the limit as e—0. 
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lim u wom ds=lim {* lee, - V(x2,¥2) + O(e2)][—n; + e-/€2] edd 
0 JT Ons «0 JO 


= — lim ‘ le, - V@(xz,yz) + O(e)] [me - e-]d0 


e>0 
vis us 
=—-—n,- V0(x:,5:) = —— wt). 
| 9 t (x2,¥2) 9 v(t) 
In performing the integral, we used the identity 

" T 
f, (a+e,)(b+e,) dd= a+b, 

0 2 


true for constant vectors a and b. The other integral is evaluated simi- 
larly. 


lim (- v =) ds 
0 Jr Ons 


=lim (" |] [-e, - V(x;,y:) + Ole ]edd 
0 € 


e—0 
= — 5 v(t). 


Thus the integral over I’ gives —zy(t), in the limit «0. Again in the 
limit «0, the integral over the remainder of I becomes a Cauchy 
principal-value integral, and (4) is obtained. The argument depends only 
on the most singular terms in the Green’s function, and so is easily 
generalized to the axisymmetric case. 
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The degradation and failure of low-noise GaAs FETs have been ac- 
celerated by various stress-aging techniques including storage at ele- 
vated temperatures with and without bias, exposure to humid atmo- 
spheres with and without bias, and temperature cycling. Several 
time-temperature-bias-induced catastrophic failure mechanisms have 
been observed, all involving the Al gate metallization. These mecha- 
nisms are Au-Al phase formation, Al electromigration, and electrolytic 
corrosion. Each of these processes results ultimately in an open gate. 
Accelerated aging also produces gradual, long-term degradation in both 
de and RF characteristics, though the two are not always correlated. 
In fact, contrary to some expectations, contact resistance may increase 
almost two orders of magnitude without significant degradation in the 
noise figure or gain of a low-noise transistor. Besides contact resistance, 
other mechanisms such as traps in the channel are thought to play a 
role in the degradation of RF properties. It was found that all the im- 
portant degradation mechanisms are bias-sensitive and that aging 
without bias gives erroneously long lifetime projections. 

The cumulative failure distributions for the mechanisms observed 
approximate a log-normal relation with standard deviations between 
0.6 and 1.4. The relevant degradation or failure processes have acti- 
vation energies near 1.0 eV, which give rise to projected median life- 
times at 60°C (channel temperature) over 107 hours and corresponding 
failure rates (excepting infant mortality) under 40 FITs (40 per 10° 
device-hours) at 20 years of service. 


Il. INTRODUCTION 


This paper describes the goals, experimental methods, and results of 
a study of the reliability of low-noise gallium arsenide field-effect 
transistors! involving about 1500 devices and 1.5 million device-hours 
of aging. The ultimate purpose of this work is twofold: (7) to calculate 
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the probable failure rate as a function of time; (i) to identify the failure 
and degradation mechanisms and propose corrective action where pos- 
sible. To estimate the failure rate of the device for any given operational 
conditions, each of these mechanisms should be characterized in terms 
of the nature of its cumulative failure distribution, the median life and 
standard deviation of the distribution, and the activation energy of the 
mechanism. 

Since the reliability of a GaAs FET depends intimately on its structural 
details, especially the choice of metallization, the structure of the devices 
studied is described in Section II of this report. The various acceleration 
methods, measurement techniques, and other aspects of the experi- 
mental program will be discussed in Section III. The failure modes ob- 
served may be categorized as either sudden or gradual. The former are 
marked by a complete collapse of dc and RF properties and are almost 
always associated with a failure of the gate metallization. They are the 
subject of Section IV. (Burn-out due to undesirable voltage pulses is 
considered a matter of handling technique or circuit design and is not 
investigated in the present work.) The gradual failures involve degra- 
dation of the important RF properties, especially the noise figure and 
the gain. There is an associated, though not well correlated, change in 
the observable de characteristics. The gradual degradation of low-noise 
GaAs FETs is discussed in Section V. In Section VI, the pertinent failure 
statistics are summarized and some cumulative failure distributions are 
shown. Finally, estimated failure rates under typical operational con- 
ditions are presented in Section VII, together with some prognoses with 
regard to other operating environments. 


ll. THE STRUCTURE 


Two slightly different versions of low-noise GaAs FETs were studied, 
differing primarily in the details of the gate bonding pad. In the earlier 
form, shown in Fig. 1, the Al gate metallization extends under the entire 
bonding area which is covered by a titanium-platinum-gold final me- 
tallization.! In the later version, the bonding area is separated laterally 
from the Al to which it is connected by the Ti-Pt-Au final metal. This 
is shown in Fig. 2. In both cases, the gate bonding pads, as well as the 
source and drain bonding pads, lie on the semi-insulating substrate. A 
schematic cross-sectional view of the source and drain contacts is given 
in Fig. 3. The ohmic contact consists of a layer of 88-percent Au/12- 
percent Ge, topped successively by a layer each of silver and gold and 
then alloyed. A final metallization of Ti-Pt-Au, as described above, is 
applied on top of the alloyed ohmic contact. 

The active n-type layer is 3000 to 6000 A thick with a donor density 
of approximately 1 X 1017 em-3. An n+ layer, about 3000 A thick and 
with a donor density around 2 X 10'8 cm~? underlies the source and drain 
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Fig. 1—Plan view of early low-noise GaAs FET. 
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Fig. 2—Plan view of later model low-noise GaAs FET. 
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ule 3—Cross section of GaAs FET contact metallurgy (relative thicknesses not to 
scale). 


contacts. A buffer layer with a donor density < 10!°5 cm~3 and 2 to 5 wm 
thick separates the active layer from the semi-insulating (Cr-doped) 
substrate. Except in a few cases, no passivation layers were present on 
the finished chip. The chip size is 0.5 mm square and 50 um thick. Each 
chip is bonded in a 2.5-mm square package which is hermetically 
sealable. 

Ill. EXPERIMENTAL METHOD 


To accelerate the degradation or failure of the GaAs FETs, a number 
of methods were used. The primary of these (about 1,000,000 device- 
hours) was aging at elevated temperatures, both with and without bias. 
The ambient was air at temperatures of 88°, 180°, 220°, 250°, and 275°C. 
The bias duplicated normal operating values consisting of 5V on the 
drain and a gate bias of —0.1 to —2.V, as necessary to produce 15 mA of 
drain current.? At this bias, the elevation of the channel temperature 
above ambient is estimated to be about 8°C. Due to the wide-band in- 
stability of GaAs FETs, RF oscillations will readily occur even at high 
temperatures. Such oscillations are in themselves sometimes destructive 
and they may also produce instantaneous, or by rectification, dc bias 
values of unknown and uncontrolled magnitudes. Thus considerable 
effort was devoted to the suppression of oscillations by various means. 
Dissipative media (Eccosorb), RC networks, and ferrite beads were 
employed, with various degrees of effectiveness. One principal difficulty 
was the incompatibility of some of the stabilizing components with the 
high temperatures involved and the fact that it is desirable to place such 
stabilization as near the FET as possible. Ferrite beads were the most 
effective and usually succeeded in quelling oscillation. Zener diodes were 
also employed in both the drain and gate supplies to protect the FET from 
destructive voltage transients. 
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While aging units under bias, the dc bias values could be monitored 
and were recorded daily. Catastrophic failures, that is, short or open 
circuits in the drain or gate, were thereby readily observed. RF properties 
could only be determined by periodic removal of the units and testing 
in either a tunable or a fixed-tuned amplifier. Thus, at intervals which 
ranged from 100 to 1000 hours, groups of FETs were temporarily removed 
from the aging environment and dc and RF measurements were per- 
formed. The dc characterization consisted of photographing the output 
characteristics from which the saturated drain current, Ipss, and the 
low-field source-drain resistance, Rs, i.e., (dVps/dIps) at Vg = Vos = 
0, were determined. The RF parameters measured were the noise figure, 
NF, and the associated gain, G, at 4 GHz and 15 mA, 5V drain bias. In 
the case of the tunable amplifier used in the earlier stages of this study, 
the minimum NF was obtained; the NF obtained in the fixed-tuned 
amplifier (a modified Western Electric 652A”) was near-minimum, but 
not actually optimized for each device. 

Another failure-acceleration technique used was storage under bias 
in air of 85-percent relative humidity at 85°C (referred to hereafter as 
85/85). In these experiments (about 150,000 device-hours), only the gate 
was biased at —44 or —6V with respect to the grounded drain; the source 
floated. Some devices were aged without bias, of course, as was also the 
case at the higher temperatures. The reverse leakage and the continuity 
of the gate were checked hourly, then daily, and finally weekly in these 
experiments, which varied in duration from a few hours to a year. Peri- 
odic RF measurements were generally not performed on these devices 
since catastrophic failure due to electrolytic corrosion of the gate was 
the mechanism studied. The purpose of the 85/85 experiments was to 
determine the integrity of “hermetically sealed” packages, the presence 
of corrosive contaminants therein, and the effectiveness of various wa- 
terproofing or passivation coatings. 

To test the security of the thermocompression bonds of the 25-um 
diameter gold leads to the source, drain, and gate bonding pads as well 
as to test the hermetic seal, devices were cycled, under bias and without 
bias, between —40° and +125°C. The continuity of the bonds was tested 
before and after cycling as well as during cycling, in a few cases. Some 
devices were also thermal-shocked by alternate immersions in freezing 
and boiling water. These tests will not be discussed further, since in no 
case (out of 28,500 bond-cycles) was an open bond observed. In fact, no 
open bonds have been encountered among any of the over 1500 devices 
tested, before or after the various aging regimes described above. No 
centrifugal or vibration tests were employed in this program. 

Lastly, a few lots of FETs have been aged without acceleration—that 
is, under normal operating dc bias (no RF) at room temperature (27°C), 
totaling 250,000 device-hours. 
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iV. CATASTROPHIC FAILURES 
4.1 Au-Al phase formation 


The most common cause of complete dc and RF failures encountered 
_in this study was related to the formation of Au-Al compounds (a version 
of this on Si devices is known as purple plague). A dramatic example is 
shown in Fig. 4. In this case, the force of thermocompression bonding 
to the top Au layer has ruptured the integrity of the intervening Ti-Pt 
layer and caused contact between the Au top layer and wire and the 
bottom Al layer at the end of the gate structure. However, the loss of gate 
continuity is not due to embrittlement and subsequent parting of the 
bond nor to the high resistance of the Au-Al compound. Fed by the 
surplus of available Au, the Au-Al system (of which AusAls is the favored 
end product) acts like a sink for the surrounding Al and has produced 
voids in the Al gate structure (the Kirkendall effect). The voids in the 
gate are visible in Fig. 4. The presence of Ga may catalyze this reaction 





Fig. 4—-SEM photo of Au-Al phase at gate-bonding pad of early model FET after aging 
at 250°C with bias. 
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as does Si in the case of Si devices? (or other devices with SiOz layers). 
Figures 5 and 6 show other examples of Au-Al interaction leading to open 
gates. Note that the FETs of Figs. 5 and 6 employ the layout shown in Fig. 
2, in which the gate bonding pad is separated laterally from the Al 
structure. However, Au and Al still were able to interdiffuse due to a 
slight mask misalignment. Both the devices shown in Figs. 4 and 5 were 
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Fig. 5—Optical photos of Au-Al phase and consequent open gates in later model FETs 
after aging 144 hours at 250°C with bias. 
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Au-A® PHASE 





Fig. 6—Optical photo of Au-Al phase and consequent open gates in later model FET 
after aging at 144 hours at 250°C without bias. (This device is from a slice much more prone 
to Au-Al phase formation than the device of Fig. 5.) 


aged under bias. The FET of Fig. 6 was aged at the same temperature 
(250°C) without bias.* 

The detailed processes of this failure mechanism have not been fully 
unravelled, but a number of pertinent observations are summarized 
below. 

(i) Every FET that has failed due to an open gate (more than 100 have 
been examined) exhibits a Au-Al interaction site somewhere in the region 
where Au and Al overlap. This site may appear to be insignificantly 
small. 

(ii) The median time to failure due to open gates is 3 to 10 times longer 


* Even with perfect registration, Au-Al contact is expected to occur eventually due to 
diffusion through the Ti-Pt barrier, though no such cases have been observed so far in these 
experiments. A mask modification which eliminates the Au layer from the bridge between 
bonding pad and gate virtually prevents any Au-Al reaction. 


_1 The authors are indebted to A. T. English for his assistance in analyzing the inter- 
diffused gate structures. 
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among units aged without bias than among identical devices aged at the 
same temperature with bias. 

(iii) In all bias-aged failures, a void appears in the gate stripe just 
where it broadens. This is the point of maximum current density in the 
gate metallization. Voids may also (but do not always) appear in the 
broad Al area or at the mesa step. 

(iv) Failures among units aged without bias have voids scattered 
generally about the broad Al regions and frequently over the mesa step, 
but usually not at the aforementioned point of maximum current den- 
sity. 

It is apparent from these observations that there is a decided depen- 
dence on the presence of bias. It is important to note that, at 250°C, gate 
leakage currents at operating bias are one to two orders of magnitude 
larger than at room temperature, i.e., 20 to 200 vA instead of 1 to 5 pA. 
Even so, the maximum gate current densities during bias aging are cal- 
culated to be only 1 X 104 A/cm2. This value is generally considered 
“safe” with regard to electromigration at 250°C. Furthermore, no cor- 
relation is found between failure and the gate currents of individual units 
during aging. Thermal dissipation in biased units in 250°C ambient 
raises the channel temperature to approximately 258°C. However, un- 
biased units in 275°C ambient have a much lower incidence of open gates 
than the 258°C bias-aged units. Thus, this aspect of self-heating cannot 
explain the bias dependence. Also, joule self-heating within the gate 
stripe itself is calculated to cause less than 1°C temperature rise, which, 
of course, also fails to justify much electromigration at these apparently 
modest current densities. However, current densities in the gate struc- 
ture are not accurately calculable, since actual Al cross sections vary with 
the topography of the surface, especially at the mesa edge. It is known 
that electromigration is influenced by grain size and very little of the 
voluminous electromigration literature treats stripe widths as small as 
the 1-um gates involved here. Thus, electromigration in conjunction with 
Au-Al phase formation is tentatively thought to be responsible for gate 
failures in bias aging. Electromigration would transport Al down the 
stripe away from the bonding pad while Au-Al phase formation causes 
diffusion in the opposite sense. Perhaps electromigration inhibits Al 
atoms near the gate throat from replacing the atoms just downstream 
in the wider portion of the structure (where the current density is less) 
which are being drawn by diffusion toward the Au-Al compound. The 
formation of voids may be accelerated by this tug-of-war situation. 

If the above hypotheses regarding this gate failure mechanism are 
correct, the temperature dependence would be quite complex. The ac- 
tivation energy of Au-Al phase formation has been variously reported 
with values between 0.6 and 1.0 eV.4 (In any case, the interaction with 
Ga may alter these values.) Al electromigration (at constant current 
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density) is reported to have an activation energy of 0.5 to 0.7 eV.5 How- 
ever, as mentioned earlier, the gate leakage current itself is tempera- 
ture-sensitive. The latter two effects together, it is calculated, should 
give the electromigration an effective activation energy of 1.2 eV. 

The experimental situation, unfortunately, is not much clearer. On 
slices (of the type in Fig. 2) where Au-Al phase formation occurs, its 
occurrence is quite erratic, depending as it does on slight vagaries in 
alignment, lift-off, etching, and other details of pattern formation. Thus, 
among devices aged without bias, the median life (ML) varies greatly 
from slice to slice, though the activation energy observed is fairly con- 
sistent and near 1 eV. Electromigration varies as the second or third 
power of current density which, in turn, differs widely from one unit to 
another. However, no failures have been observed which appeared to be 
due to electromigration alone. When units are aged under bias at ele- 
vated temperature, both mechanisms are thought to be operative, though 
the degree of dominance by the one mechanism or the other probably 
varies both among devices and as a function of time during the course 
of void formation. Initially, phase formation is probably dominant, but 
when the cross-sectional area has been diminished enough and the local 
current density increases, electromigration becomes more important. 
In principle, it is inappropriate to use an activation energy to characterize 
this joint process consisting of two mechanisms. However, since both 
mechanisms are expected in this case to have an activation energy near 
1 eV, as described above, it is a useful approximation to apply an “acti- 
vation energy” to the combined effect. As expected, the experimental 
data are not entirely consistent, but are grouped about a value of 
1.0 eV. 

No typical ML can be cited for bias-aged devices, since many slices are 
entirely free of this mechanism. However, in the worst case, an ML of 94 
hours at 250°C with bias has been observed. It is important to note that 
this mechanism has been observed in the present study in devices bias- 
aged at temperatures as low as 180°C in times as short as 240 hours. 
Weaver and Brown detected Au-Al interdiffusion at 84°C in 3 hours.® 
Thus, Au-Al phase formation and subsequent destruction of GaAs FETs 
in which the choice of metallurgy and layout permits this combination 
cannot be dismissed as an exclusively high temperature phenomenon. 
However, an appropriate layout can completely eliminate the possibility 
of Au-Al phase formation. 


4.2 Electrolytic corrosion 


Figure 7 is an example of electrolytic gate corrosion. The corrosion 
shown was produced by a 2-hour exposure to an atmosphere of 85°C/85% 
RH with 6 V negative bias on the gate. The device was uncapped. This 
corrosion is clearly electrolytic, since in the absence of gate bias no sig- 
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Fig. 7—-Optical photograph of FET after 48 hours of aging, uncapped, in 85°C/85% RH 
humidity chamber, showing electrolytic corrosion of gate. 


nificant corrosion is observed. Electrolytic corrosion of unprotected Al 
structures is well known, of course, from reliability studies of silicon 
devices.” The unusually close electrode spacing and consequent high 
fields in GaAs FETs as well as the minuteness of the Al gates make them 
prime candidates for this failure mechanism, in humid conditions. The 
acceleration factors relative to both varying humidity and temperature 
have already been reported in the Si device literature®? and is summa- 
rized in Section VI. 

Various passivation or protective coatings have been proposed for the 
prevention of electrolytic corrosion in GaAs FETs. Schemes that only 
coat the GaAs, such as grown oxides, would not be expected to be ef- 
fective. The highly irregular topography of GaAs FETs complicates the 
task of achieving a continuous, impervious, pinhole-free, protective film. 
Equally important is the requirement that the film have small dielectric 
constant and low microwave loss; otherwise, the sacrifice in microwave 
performance is unacceptable. None of the films explored in this study 
fulfills all these specifications perfectly. 

A hermetically sealed package can provide permanent protection 
against electrolytic corrosion from external humidity and without any 
sacrifice in RF performance, at least at frequencies where a package can 
be tolerated. It is suspected, however, as observed already among Si 
devices,!° that residual impurities entrapped inside the package can 
produce destructive electrolytic corrosion, although the seal remains 
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intact. Water and chlorine are the chief offenders, and the trace amount 
of both which may be adsorbed on the package interior surface are ap- 
parently sufficient to produce corrosion. Patches of unremoved photo- 
resist may also harbor enough impurities to cause corrosion. Figure 8 
shows the corroded gate of a sealed device that failed after 240 hours 
under bias in 85/85 though no leak was detectable after removal from 
the chamber. A Krypton 85 radio-tracer technique was used for leak 
detection, which has a sensitivity in this case of 10~° std cm/s. Though 
a leak below the detectable limit cannot be excluded and might have 
caused the corrosion, the Si experience!° must be borne in mind and 
residual contamination suspected. This would be confirmed by the 
discovery of electrolytic corrosion in sealed devices aged at 80° to 90°C 
in dry air. Among the relatively few devices (30) aged in this manner in 
the present study, no corrosion has been observed. However, among a 
group of devices which had failed optical inspection due to unusually 





LS 


Fig. 8—SEM photograph of corroded gate of FET after 1816 hours of aging, sealed and 
leak-tight, in humidity chamber. Analysis reveals traces of Cl at corrosion site. 


nee 


2834 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1978 


large amounts of photoresist remaining on the chip and which were 
sealed and aged in 85/85, the incidence of corroded gates was 50 percent 
in 2000 hours. Altogether, the incidence of electrolytic gate corrosion 
among “clean” devices which passed optical inspection and which also 
passed the leak test before and after aging has been about 1 percent in 
2000 hours. 

V. GRADUAL DEGRADATION MECHANISMS 

5.1 High-temperature effects 

One of the earliest GaAs FET degradation mechanisms to be discussed 
was an increase in contact resistance.!! According to well supported 
models,!2-13 Ga diffuses out of the crystal into the contact metallization 
at elevated temperatures. The resulting Ga vacancies probably act as 
acceptors, compensating the donors in the n-type lamina immediately 
adjacent to the contact, and thereby increase the contact resistance. The 
capacity of the metallization for absorbing Ga, or the effectiveness of 
an interposed barrier to the transport of the Ga, speed or inhibit the 
degradation process, respectively. In the ohmic contact structure de- 
scribed in Section II and illustrated in Fig. 3, the heavy final gold layer 
is the largest potential sink for migrating Ga, while the Ag and Ti-Pt 
layers act as an impeding barrier. As will be seen, however, other factors 
(such as the alloying cycle) must also play a role in the degradation of 
ohmic contacts. 

By means of special test patterns and an appropriate computer pro- 
gram, the contact resistivity, p., and the channel resistance of a gateless 
device, R(ch), were measured on certain FET slices before and after aging 
at 250°C, both with and without bias (0.3 A/cm, the same current per 
unit source width as in an operating device). A number of actual FETs 
from the same slice were also aged at the same temperature, with and 
without bias, and the usual parameters measured (Rs, Ings, NF, and G). 
Some slices (which will be designated Class I) showed virtually no change 
in any parameters after 500 hours, with or without bias, in either test 
patterns or actual FETs. Other slices (designated Class II), though 
nominally identical to Class I in design and fabrication, showed startling 
changes in certain dc parameters, as summarized for one slice in Table 
I. Typical values of Rs for unaged FETs were 15 to 30 ohms, of which the 
contact resistance contribution is only 0.2 to 0.5 ohms. The remainder 


Table | — Changes in various parameters after 500 hours of aging at 250°C 
for a Class Il slice 


Test Patterns Actual FETs 

Pe R(ch) Rs Ipss NF G 
With bias +5000% 0 +50% —16% 0 0 
Without bias +5000% 0 +20% —5% 0 0 
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of Rg is the resistance of the channel and of the semiconductor portions 
of the source and drain regions. Thus, it would appear plausible that a 
5000-percent increase in contact resistance, as shown in Table I, would 
cause an approximate doubling of Rs. However, it is noted that the 
change in p, was not affected by bias, whereas the change in Rg and Ipss 
was very much bias dependent. (The latter bias dependence is also seen 
in Fig. 9.) The change in Rg (and corresponding change in Ipgs) is 
therefore not wholly attributable to contact deterioration. 

The origin of the bias-dependent component of Rg has not been de- 
termined. One possibility is recombination enhanced defect formation,4 
though the hole production at the drain and under the gate seems too 
small for this effect. It is also suggested that the bias-dependent degra- 
dation may be related to the gate, since the test patterns, which were 
unaffected by bias, have no gates. It is also not understood what the es- 
sential difference is between Class I and Class II slices—and the con- 
tinuous spectrum of behavior between these two extremes. It is thought 
that the least-controlled processing step may be the contact alloying, 
which is therefore tentatively blamed for at least a part of the slice-to- 
slice variation in aging behavior. Fortunately, perhaps because the de- 
graded dc qualities of contacts are capacitively bypassed by RF signals,1© 
these wide fluctuations in the degradation of dc parameters are not re- 
flected in the RF performance. 

Figure 9 shows the average values of Rs, Ipss, NF, and G for two 
groups of FETs from the same slice (a Class II slice) aged at 250°C, one 
group with and the other without bias. Though both the de and RF 
characteristics degrade faster with bias than without, it is seen that, while 
Rg doubles, the noise figure and gain only deteriorate by 0.2 to 0.5 dB. 
Another example is shown in Fig. 10, where NF and G degrade only 0.2 
and 0.3 dB, respectively, while again Rs doubles. (The actual contact 
resistance increased 50-fold.) Two other interesting cases, both repre- 
sentative of many, are shown in Figs. 11 and 12. The devices of Fig. 11 
suffered only negligible changes in Rs, Ipss, and NF after 1300 hours 
of bias-aging at 250°C, though the gain declined about 0.6 dB. Figure 
12 shows the data from a group of devices in which none of the measured 
dc or RF parameters changed significantly in 1500 hours of bias-aging. 
The results presented in Figs. 9 through 12 may be summarized as fol- 
lows: 

(i) There are significant differences among slices in the way the dc 
and RF characteristics change upon aging. 

(it) Radical deterioration of contact resistance (5000 percent) or of 
source-drain resistance (100 percent) are accompanied by only minor 
degradation of RF performance; conversely, NF and G may degrade 
slightly, even though Rg remains constant. 

(tii) The median life at 250°C under bias, where failure is defined as 
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250°C WITH BIAS 
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Fig. 9—Plot of median Rs, Ipss, NF, and associated gain as a function of aging time | 
at 250°C for two groups of GaAs FETs from slice (1); Group A aged with bias and Group 
B aged without bias. (Note acceleration due to bias.) 
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Fig. 10—Plot of median Rs, Ipss, NF, and associated gain of a group of GaAs FETs from 
slice (2) as a function of aging time at 250°C with bias. (Note NF and G degrade only 0.2 
dB, though Rg increases 90 percent.) 


an NF or G degradation of equal to or more than 0.2 or 0.8 dB, respec- 
tively, is at least 1500 hours. 

The observed activation energy of RF degradation is 0.8 to 1.0 eV. It 
may be noted that many diffusion phenomena within or on the surface 
of semiconductors, such as might produce traps or scattering centers, 
have activation energies near 1.0 eV. 
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Fig. 11—Plot of median Rs, Ipss, NF, and associated gain of a group of GaAs FETs from 
slice (3), as a function of aging time at 250°C with bias. (Note NF and G degrade slightly, 
though Rs in this case remains unchanged.) 
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Fig. 12—Plot of median Rs, Ipss, NF, and associated gain of a group of GaAs FETs from 
slice (4) as a function of aging time at 250°C with bias. (Note that all measured properties 
remain essentially unchanged.) 


Vi. FAILURE STATISTICS AND UNOBSERVABLES 


6.71 Cumulative failure distributions 


The statistics obtained from an aging study depend to some extent 
upon the definition of failure. A definition of failure can be tailored to 
a specific failure mechanism and thus be used to sort out data that are 
relevant exclusively to that mode. Two definitions have been used in 
various stages of the present investigation. 

A. Catastrophic—collapse or radical change in dc output character- 
istics, usually due to a short or open circuit in one or more of the three 
electrodes. This definition is especially appropriate for study of the 
catastrophic mechanisms discussed in Section IV, but ignores any deg- 
radation of RF performance not associated with a large change in dc 
behavior. 

B. RF degradation, exclusively—requires that any units that suffer 
catastrophic failure be subtracted from the population and not counted 
in the statistics. Figures 9, 10, 11, and 12 were based on such a population. 
The degree of permitted RF deterioration should be set with system re- 
quirements in mind. In this study, unless otherwise specified, a device 
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was considered to have failed, RF-wise, if the noise figure increased:0.2 
dB or more, or the associated gain changed (up or down) by 0.8 dB. or 
more as measured at 4 GHz in a fixed-tuned.amplifier.. 

Figure 13 is a log-normal plot of the cumulative percent failures of type 
B as a function of aging time for 31 devices representing four separate 
slices:. The aging was performed with bias at 250°C air ambient.. The 
channel temperature is estimated to be 8°C warmer..A few of the devices 
were sealed, but the majority were not. No difference has been observed 
in the aging behavior of sealed versus unsealed FETs at this temperature... 
The data are seen to fit reasonably a straight line, making allowance for 
the statistical vagaries of small samples, which means they approximate 
a log-normal distribution. The standard deviation estimate,.s, of the line 
is about 1.3, obtained from the operational calculation 

s = In[t(50)/t(16)], 

where it. is noted the natural logarithm is used. and ¢(50) and t(16) are 
the times corresponding to 50 and. 16 percent cumulative failure, re-- 
spectively. The median life of this group is about 1700 hours.. The results 
shown in Fig: 13 are typical of the RF degradation. observed. in this. 
study.. 
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Fig: 18—-Log-normal plot of cumulative failure distribution (RF degradation) of a group 
of GaAs FETs aged with bias at 250°C. 
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Figure 14 shows the cumulative failure distributions at 250°C of two 
groups of units from one slice, one aged with bias and the other without 
bias. The MLs are about 100 hours and 350 hours, respectively, with 
nearly the same standard deviation of s = 1. The failures in this case are 
all type A and due to Au-Al phase formation, plus an apparent assist 
from electromigration in the biased group, as discussed in Section 4.1. 
This slice was unusually susceptible to the Au-Al phase problem and is 
chosen here to illustrate the failure statistics of that mechanism. 


6.2 Humidity acceleration factors 


Electrolytic corrosion is accelerated by increased humidity and tem- 
perature mainly as a result of and in proportion to the increased electrical 
conductivity of the surface. The problem has been most recently studied 
by Sbar and Kozakiewicz,? who give acceleration factors with respect 
to 85°C/85% RH for various encapsulations and temperature/humidity 
conditions. Though the absolute value of conductance on a GaAs surface 
may differ from that on a Si, Si3N4, or alumina surface, the temperature 
and humidity dependence are expected to be similar. For 60°C/5% RH 
(a choice which will be justified later), the Sbar-Kozakiewicz results 
indicate an acceleration factor of 2 X 10° with respect to 85/85. For a 
condition of 60°C/25% RH, the factor is about 10*. Both values apply to 
an unencapsulated device. For a perfectly sealed device, of course, the 
external humidity has no effect. The only acceleration of electrolytic 
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Fig. 14—Log-normal plot of cumulative failure distributions of two groups of GaAs FETs 
aged at 250°C with and without bias. (All failures were due to gate destruction by Au-Al 
phase formation plus, in one case, a bias-dependent factor.) 
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corrosion would be that due to the temperature elevation, assuming there 
are sufficient contaminants inside the package to produce an electrolyte, 
but that their release is not temperature-sensitive. The acceleration 
factor at 85°C relative to 60°C is approximately 3. 


6.3 Statistical error 


During the course of this study, numerous small changes have been 
made in the design or fabricational methods of the device under inves- 
tigation. Since it was desirable to appraise the reliability aspects of each 
of these variations and both facilities and device are limited in supply, 
the strategy has been to age many small lots of FETs rather than fewer 
and larger lots. The relatively small sample size (10 to 20) raises questions 
about the validity of the statistics obtained. It should be pointed out, 
therefore, that with a sample size of 10, one can be 90 percent confident 
that the true ML (i.e., the ML of an infinitely large sample) would be 
somewhere between 45 and 225 percent of the observed value, assuming 
a log-normal distribution with a standard deviation of 1. Thus, in the 
next section where failure rates are calculated, error tolerances may be 
attached to the values given by noting that in the area where most of the 
data fall, a factor of 2 in ML produces a factor of approximately 2 to 3 in 
failure rate. 


6.4 Possibility of unobserved failure mechanisms 


A relatively small number of FETs have been aged under bias for pe- 
riods of 0.5 to 1.5 years at moderate or room temperatures (more pre- 
cisely, 30 units at 180°C for 6000 hours, 10 units at 88°C for 4600 hours, 
and 30 units at 27°C for 8000 to 14,000 hours, all in air ambient). Judging 
from the statistics obtained at higher temperatures, provided that only 
the same mechanisms prevail, no failures or degradation should be ap- 
parent in these modest times at lower temperature, except possibly for 
the Au-Al phase-migration problem. Indeed, this turns out to be the case: 
five units of one lot of 10 at 180°C have failed due to the Au-Al syndrome, 
but otherwise no degradation appears in any of the units. 

The lack of any failures among the other units provides some lower 
bound to the activation energy of any failure mechanism which may be 
important at lower temperature but is obscured at higher temperatures 
by phenomena with higher activation energies. (This sort of insidious 
situation has been encountered in Pt-GaAs IMPATTs.!®) It will be as- 
sumed that such a failure mechanism would have a log-normal failure 
distribution with a standard deviation of 1. It is noted furthermore that 
the absence of a single failure in a lot of 30 indicates with 90 percent 
confidence that the true percent failure cannot exceed 10 percent. Fi- 
nally, it is noted that the ML of any such so-far-unobserved failure 
mechanism must be at least 1700 hours at 250°C, since that is the longest 


LOW-NOISE GaAs FETs 2841 


observediML at that temperature due to recognized mechanisms. Com- 
‘bining these-arguments leads to the conclusion that the minimum acti- 
‘vation energy of.a failure mechanism active at 180°C, but obscured at 
‘250°C is 0.7 eV. However, the minimum activation energy for a failure 
‘mechanism dominant at.27° to 180°C but not yet observed in this pro- 
‘gram is only 0:03 eV. The latter figure.is rather alarming. It means, in 
conjunction with the high temperature data, that if such a hypothetical 
‘mechanism exists, the projected ML :at room ‘temperature would be 
‘30,000 hours and the:ML would be only negligibly accelerated by elevated 
temperatures. The duration of the present reliability program is insuf- 
ficient to rule out such:a possibility. However, it is reassuring that other 
‘investigators have reported room temperature tests of low-noise GaAs 
FETs of similar metallurgy in excess of six years without :any fail- 
‘ures.!7 
6.5 Infant mortality 

Few instances of infant mortality have been found ‘in this study. There 
‘are two reasons: (i) rigorous optical inspection of all ‘chips before 
‘mounting and discarding of any units which appear mechanically or 
‘electrically defective after: mounting eliminate most devices that might 
otherwise be candidates for early failure; (ii) the small size of the samples 
‘cused further diminishes the probability of encountering ‘anomalous 
‘devices representing only a.small proportion of the population. Thus, 
ithis work sheds no light on the nature of such early failures except'that, 
‘with the present fabrication, inspection, and testing procedures, their 
‘occurrence is less than 1 percent. The failure and degradation mecha- 
‘nisms discussed here and the failure rates projected pertain ‘to:the:main 
‘body of the population. It must be anticipated that-some cases of infant 
umortality will:accompany large-scale production-and deployment:of this 
‘device. 
‘VIL ESTIMATION OF ‘FAILURE RATES 


‘Given ‘the nature of the failure distributions at:an elevated tempera- 
ture and their :respective activation ‘energies, ‘and making the all-im- 
iportant:assumption ‘thatithe:mechanisms studied:at elevated tempera- 
itture arealsothedominant ones:at room temperature, and with the fur- 
ther assumption ‘that the nature ofeach failure distribution 1s not item- 
sperature-dependent ‘(i-e., it:stays log-normal with ‘the‘same ss), ‘itremains 
only to specify the operating conditions in order towalculate ‘the probable 
failure ratesin tthe field. The:maximum ‘ambient ttemperature ina Bell 
‘System radio relay ‘application is 52°C (125°F). The corresponding 
mmaxinrum ‘channel temperature would be 60°C. "Though the :annual 
‘average temperature would certainly be-considerably dower, ‘60°C wall 
ibe taken as ithe channel ‘temperature for :calculation of failure rates. 
“Dhree separate cases will ‘be considered. 
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7.1 Casel: No catastrophic mechanisms 

In this case, it is assumed all fabricational steps have been faultless, 
assuring the absence of any contact between Au and Al, a contamina- 
tion-free chip and package interior, and a leak-tight seal. The cata- 
strophic failure mechanisms are therefore precluded, and only long-term 
degradation of RF properties is of concern. The median lifetime (type 
B) at 258°C was found to be about 1700 hours, and the associated acti- 
vation energy will be taken as 0.8 eV. Thus, the projected ML at 60°C 
would be 4 X 107 hours. Taking a standard deviation of 1.0 and using 
Goldthwaite’s curves,!® the failure rate after 20 years of service is found 
to be less than 107? FIT (1 FIT = 1 failure in 10° device-hours). It should 
be noted that, with a standard deviation of 1.5, the projected failure rate 
would be 2 FITs and with s = 2, the failure rate is 30 FITs, i.e., the dif- 
ference between s = 1 and s = 2 is more than 8 orders of magnitude in 
failure rate. Thus, an accurate knowledge of the standard deviation is 
vital to the accurate forecasting of failure rates. However, the low con- 
fidence levels of the statistics do not justify quibbling over the real value 
of s, and the predicted failure rates are small in any case (but do not 
include infant mortality). 


7.2 Case Il: Au-Al phase formation dominant 


As mentioned in Section 4.1, the ML due to Au-Al phase formation 
at 250°C has been observed to be as short as 94 hours, though it exceeds 
observation times in many cases. Based on this shortest observed ML 
and the smallest observed activation energy of 0.5 eV, a worst-case 
prediction is obtained, indicating that for an unscreened product the 
failure rate could go over 10,000 FITs, i.e., 1 percent per 1000 device- 
hours. However, a reliability qualification test of each slice can be used 
to assure that the ML due to the Au-Al/electromigration syndrome is no 
less than 500 hours at 250°C. Assuming a relatively conservative value 
of 0.8 eV for the activation energy (from the wide range observed of 0.5 
to 1.6 eV), an ML at 60°C of 1 X 10” hours is projected. Taking s = I, as 
found in Fig. 14, gives an estimated failure rate of 0.6 FIT in a 20-year 
service period. Taking s = 1.5, as found in occasional slices also domi- 
nated by the Au-Al failure mechanism, gives a failure prediction of 40 
FITs. The latter value is considered a realistic upper limit for devices of 
the type shown in Fig. 2 subjected to a reliability screening procedure 
(and is therefore the value quoted in the abstract). 


7.3 Case Ill: Electrolytic corrosion dominant 


If unsealed, unprotected low-noise GaAs FET chips were employed 
in an amplifier in which the housing was not hermetically sealed, elec- 
trolytic corrosion as described in Section 4.2 would be expected. Since 
in some radio relay applications, the waveguide is pressurized with 5- 
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percent RH air, this value for the ambient will be considered for the first 
example. (It should be noted that 30°C/25% RH air becomes 5% RH air 
when heated, at constant water vapor content, to 60°C). For unprotected 
units in 85/85, an ML of about 3.5 hours due to electrolytic corrosion has 
been observed. Using the appropriate acceleration factor of 2 X 105, as 
described in Section 6.2, an ML of 7 X 10° hours at 60°C/5% RH is pre- 
dicted. The corresponding failure rate is about 1000 FITs. For a second 
example, an atmosphere of 60°C/25% RH is chosen, which may be ob- 
tained by heating 32°C (90° F)/100% RH air up to 60°C. In this case, an 
ML of 8.5 X 104 hours and a failure rate of about 20,000 FITs after 5 years 
are predicted. 

As a third example, it might be assumed that the GaAs FET chip is 
unprotected and the amplifier is hermetically sealed, but not adequately 
free of contaminants. Indeed, in view of the large amount of surface 
within an amplifier and the difficulty of giving it a high-temperature 
vacuum bakeout, it very likely would contain dangerous amounts of 
residual impurities. An ML of about 2000 hours has been observed in this 
study with contaminated packages at 85°C. The acceleration factor 
relative to 60°C in this case is only 3, as discussed in Section 6.2. Thus, 
an ML of 6000 hours might be anticipated for this amplifier with un- 
sealed, unprotected FETs and a first-year failure rate of over 50,000 
FITs. 

It is emphasized that the above three examples of electrolytic corrosion 
assume unsealed, unprotected (unpassivated) devices. In the case of a 
clean, hermetically sealed device, electrolytic corrosion is effectively 
prevented, and no failures due to that mechanism are expected. 


Vill. CONCLUSIONS 


Two catastrophic failure mechanisms were found in this study of 
low-noise GaAs FETs, not including voltage transients which are con- 
sidered primarily a problem of handling technique and circuit design. 
One of these mechanisms is Au-Al phase formation occurring at the 
junction of the Al gate and its Au bonding pad. This mechanism is en- 
hanced by bias through what appears to be electromigration, though 
positive evidence of the latter is lacking. In a worst case, this mechanism 
could give rise to failure rates as high as 10,000 FITs, though with ap- 
propriate slice screening, values in the neighborhood of 1 to 50 FITs ap- 
pear more likely. Proper design and fabrication methods can eliminate 
this mechanism entirely. The other catastrophic failure mechanism is 
electrolytic corrosion of the Al. In a humid environment or in a con- 
taminated package, failure rates again in the order of 10,000 FITs might 
be anticipated. However, hermetic sealing in a contaminant-free package 
eliminates this problem. It is noted that both these failure mechanisms 
are related to the choice of an Al gate. They are not peculiar to GaAs 
FETs, but are well known as causes of failure in Si devices. 
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In the absence of catastrophic failure, a long-term, gradual degradation 
of noise figure and gain is observed. This effect is only weakly correlated 
with increase of contact resistance and is apparently more strongly in- 
fluenced by other factors such as the formation of traps and scattering 
centers. The median lifetime due to this gradual RF degradation is es- 
timated to be over 10’ hours at a channel temperature of 60°C. The 
corresponding failure rate after 20 years of service is less than 2 FITs. 

All the important failure modes were accelerated by the presence of 
drain and gate bias. Aging without bias would give erroneously optimistic 
predictions. 


IX. ACKNOWLEDGMENTS 


The authors are indebted to many colleagues whose contributions 
significantly aided this work. Special thanks are due to J. P. Beccone, 
W. L. Boughton, J. V. DiLorenzo, A. T. English, D. E. Iglesias, L. C. 
Luther, F. M. Magalhaes, W. C. Niehaus, Mrs. Y. C. Nielsen, R. H. Saul, 
and W. O. Schlosser. 


REFERENCES 


1. B.S. Hewitt, H. M. Cox, H. Fukui, J. V. DiLorenzo, W. O. Schlosser, and D. E. Iglesias, 
“Low Noise GaAs MESFETs: Fabrication and Performance,” 1977 GaAs and Related 
Compounds (Edinburgh), 1976 (Inst. Phys. Conf. Ser. 33a), p. 246. 

. R.H. Knerr and C. B. Swan, “A Low-Noise GaAs FET Amplifier for 4 GHz Radio,” 
B.S.T.J., 57, No. 3 (March 1978), p. 479. 

. B. Selikson, “Failure Mechanisms in Integrated Circuit Interconnect Systems,” 6th 
Annual Proc. Rel. Phys. Symp. (IEEE) (1968), p. 201. 

. E. Philofsky, “Purple Plague Revisited,” Solid-State Electronics, 13 (October 1970), 
p. 1391. 

. I. A. Blech and E. S. Meieran, “Electromigration in Thin Al Films,” J. Appl. Phys., 
40 (February 1969), p. 485. 

. C. Oe L. C. Brown, “Diffusion in Evaporated Films of Au-Al,” The Phil. Mag., 
7 (1961), p. 1. 

. B. Reich and E. B. Hamkin, “Environmental Factors Governing Field Reliability of 
Plastic Transistors and Integrated Circuits,” 10th Annual Proc. Rel. Phys. Symp. 
(IEEE) (1972), p. 82. 

. D.S. Peck and C. H. Zierdt, Jr., “Temperature-Humidity Acceleration of Metal- 
Electrolysis Failure in Semiconductor Devices,” 11th Annual Proc. Rel. Phys. Symp. 
(IEEE) (1973), p. 149. 

. N.L. Sbar and R. P. Kozakiewicz, ““New Accleration Factors for Temperature, Hu- 
rey’ Bias Testing,” to appear in 16th Annual Proc. Rel. Phys. Symp. (IEEE) 

10. A. Shumka and R. R. Piety, “Migrated-Gold Resistive Shorts in Microcircuits,” 13th 
Annual Proc. Rel. Phys. Symp. (IEEE) (1975), p. 93. 

11. T. Irie, I. Nagasako, H. Kohza, and K. Sekido, “Reliability Study of GaAs MESFETs,” 
IEKE Trans. on Microwave Th. and Tech., MTT-24 (June 1976), p. 321. 

12. K. Ohata and M. Ogawa, “Degradation of Au-Ge Ohmic Contact to n-GaAs,” 12th 
Annual Proc. Rel. Phys. Symp. (IEEE) (1974), p. 278. 

13. A. Christou and K. Sleger, “Precipitation and Solid Phase Formation in Au(Ag)/Ge 
Based Ohmic Contacts for'GaAs FETS,” 6th Biennial Conf. on Active Microwave 
Semiconductor Devices and Circuits, Cornell, 1977. 

14. L. C. Kimerling, “New Developments in Defect Studies in Semiconductors,” IEEE 
Trans. on Nuclear Sci., NS-23 (1976), p. 1497. 

15. J.C. Irvin and R. L. Pritchett, ““Nonohmic Contacts for Microwave Devices,” Proc. 
IEEE (Corres.), 58 (November 1970), p. 1845. 

16. W. C. Ballamy and L. C. Kimerling, “Premature Failure in Pt-GaAs IMPATTS-Rec- 
ombination Assisted Diffusion as a Failure Mechanism,” Tech. Digest IEDM (IEEE) 
(1977), pp. 90-92. 


q1 D> oO & WD Pb 


[oe 2) 


ite} 


LOW-NOISE GaAs FETs 2845 


17. D. A. Abbott and J. A. Turner, “Some Aspects of GaAs MESFET Reliability,” IEEE 
Trans. on Microwave Th. and Tech. M-24 (June 1976), p. 317. 

18. L. R. Goldthwaite, “Failure Rate Study for the Log-Normal Lifetime Model,” Proc. 
7th Nat’l. Symp. on Reliability and Quality Control, 208 (January 1961). [This curve 
was reprinted in the 9th Annual Proc. Rel. Phys. Symp. (IEEE) (1971), p. 78]. 


2846 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1978 


Copyright © 1978 American Telephone and Telegraph Company 
THE BELL SYSTEM TECHNICAL JOURNAL 
Vol. 57, No. 8, October 1978 
Printed in U.S.A. 


Estimation of Point-to-Point Telephone Traffic 
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Estimates of point-to-point telephone traffic are required for the 
current and the long-range planning of the Bell System’s Public 
Switched Network. Because of the potentially immense volume of data 
which must be processed, these estimates are typically based upon 
small samples of total traffic and, therefore, can have large statistical 
errors. In this paper, we develop a model for quantifying the accuracy 
of point-to-point traffic measurements as a function of sample size and 
traffic parameters. Together with a worth-of-data model, not described 
here, our results can be used to establish a cost-optimal sampling rate 
for point-to-point traffic measurement systems. However, our results 
have been used to establish 20 percent as an upper bound on a cost- 
optimal sampling rate for a usage measurement system and 10 percent 
for an attempt-only measurement system. We show, however, that the 
attempt-based estimate is, for sampling rates greater than about 2 
percent, less accurate than the usage-based estimate. We also show how 
the accuracy of point-to-point load estimates can be improved by 
employing a ratio-estimate which combines point-to-point and 
trunk-group measurements; however, in practical applications, we find 
that the improvement is not significant. 


1. INTRODUCTION 


‘Trunk-group and point-to-point traffic data systems provide the 
measurements of telephone traffic which are used for the current and 
the long-range planning of the Bell System’s Public Switched Network. 
‘Trunk-group data systems provide estimates of the traffic offered to 
existing trunk groups. Normally, an estimate of trunk-group offered load 
is ‘based upon a direct measurement of the average number of busy 
trunks, the average attempt count, and the average overflow count.! 

Point-to-point traffic data systems provide estimates of the telephone 
‘traffic :which originates at one and terminates at the other of a specific 
‘pair .of network points not necessarily joined by a single trunk group; for 
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example, the end-office pair (A;,B,) of Fig. 1. In the trunk-provisioning 
process, estimates of point-to-point offered loads are required to plan 
for the introduction of new trunk groups and the rehoming of end-offices 
or tandems. In general, they are also used, as a supplement to trunk- 
group measurements, in the network disassembly process (the process 
that converts measured loads on trunk groups which receive overflow 
traffic to first-route loads) and in the network assembly process (the 
process which converts projected first-route loads to total offered loads). 
Moreover, with the possible introduction of dynamic traffic routing, our 
studies have shown that the trunk-provisioning process will require more 
extensive use of point-to-point data than is required in the present 
hierarchical fixed-routing network. 

Estimates of point-to-point offered loads cannot, in general, be derived 
from trunk group measurements since trunk groups typically carry more 
than one point-to-point load. Instead, estimates of point-to-point offered 
loads are derived from detailed records of the origin, destination, and, 
when available, holding times of individual calls. (When holding times 
are not available, a load estimate can be based upon an attempt count 
measurement together with an exogenous estimate of mean holding time; 
see Section 3.2.) 

To reduce the costs for recording and processing point-to-point data, 
most existing measurement systems have been designed to record only 
a small sample of total traffic. For example, the Centralized Message 
Data System (CMDS, see Section II) provides estimates of point-to-point 
loads derived from a 5-percent sample of all toll calls. But while sampling 
reduces the cost of providing point-to-point data, it also introduces 
statistical measurement errors that reduce the accuracy and, hence, the 
worth of the data. 






TANDEM 


Fig. 1—An application of point-to-point data: planning new trunk groups. 
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In this paper, we develop a model for quantifying the accuracy of 
point-to-point traffic measurements as a function of sample size and 
traffic parameters. Together with a worth-of-data model? for quantifying 
the cost impact of data errors on the network provisioning process, this 
data accuracy model can be used to establish the trunk-engineering re- 
quirements for point-to-point data systems. 

In Section II, we describe how point-to-point loads are measured by 
CMDS and we develop a model for quantifying sampling error. While this 
is a specific example, the methods and results are directly applicable to 
other (existing and proposed) point-to-point traffic measurement sys- 
tems. In Section III we use our model to analyze three methods for es- 
timating point-to-point loads: one based upon a usage measurement, 
one upon an attempt count together with an exogenous estimate of 
holding time, and one upon a combination of point-to-point and 
trunk-group measurements. A summary is given in the last section, and 
the required statistical results are developed in Appendices A and B. 


Il. POINT-TO-POINT MEASUREMENTS 


For toll traffic, the major source of point-to-point data is provided by 
the Centralized Message Data System. In this section, we describe the 
CMDs data base and model the various sources of error. 


2.1 The CMDS data base 


For every point-to-point traffic item (defined by originating and 
terminating end-office prefix codes), the CMDS data base provides an 
estimate of both the total number of calls and the associated usage (i.e., 
sum of holding times) for calls that originate during a time-consistent 
hour over 20 consecutive business days. 

These estimates are based upon a 5-percent sample of the total 
number of calls processed by the toll billing equipment in each Regional 
Accounting Office (RAO). Figure 2 illustrates the process. Automatic 
Message Accounting (AMA) tapes are periodically shipped to a Regional 
Accounting Office where they are processed to produce sequential rec- 
ords of the origin, destination, and conversation time of individual calls. 
As these records are processed for customer billing, the record for every 
20th call is transmitted (in a batch mode) to the CMDS computer in 
Kansas City, where they are sorted and summarized to provide estimates 
of individual point-to-point loads. . 


- 2.2 Sources of error 


Since estimates of point-to-point offered loads are based upon mea- 
surements made over several time-consistent hours, and since source 
loads are known to vary from day to day, our model will account for 
statistical errors due to both the finite measurement interval and day- 
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Fig. 2—Centralized message data system. 


to-day load variation:? Furthermore, since the measurements are ob- 
tained from a 5-percent.sample of total traffic, our model will also ac- 
count for variations in the sample.size for individual point-pairs. That 
is, depending upon the position of calls in the sequence of message rec- 
ords (from which the 5-percent sample is obtained) the actual sample 
‘size for an individual point-pair can be more, or less, than 5 percent. In 
Section ‘2.3, we develop a model for quantifying these sources of error. 

“The CMDS data base excludes toll traffic which is not billed. In addi- 
tion, of course, to blocked calls, CMDS also excludes call set-up and 
ringing time (for both completed and noncompleted calls), directory 
assistance calls, and official calls whichare not detailed billed. Estimates 
.of this nonbilled usage-are, therefore, an additional source of error for 
‘CMDS-based load estimates. However, our:studies have shown that this 
error-is negligible in comparison with sampling error and, therefore, it 
‘will not be accounted for by our:model. (Section 3.2 describes a method 
for estimating nonbilled:usage.) 


2.3 ‘Mathematical model 


Estimates of point-to-point offered Joads:are normally based upon 
‘measurements made:over -K disjoint ‘time-consistent intervals J,,. . Jk, 
each of length t (typically,:K =20:and t = 1-hour). We assume that the 
‘distribution of realized loads can be:described by the model used by Hill 
‘and Neal? to explain the observed variation.of trunk-group offered loads. 
"Thus, during J;, we:assume that-call:arrivals-are Poisson-distributed* 
‘with rate :Aj:and thatcall-holding times are independent and exponen- 


“* Roint-to-point:offered loads correspond ‘toitrunk-group first-offered (Poisson) loads; 
thence, ‘itiis:appropriate itoset ithe peakedness ‘factor, z,of Ref.:3 to.unity. 
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tially distributed with mean h. Furthermore, in accordance with the 
model for day-to-day load variation developed in Ref. 3, the loads a; = 
Ajh, i = 1,...,K, are assumed to be independent and identically dis- 
tributed with mean a = Ah and variance 


2a 

= FS petemeeniets 
Ug max|0, 0.13a a (1) 
where ¢ is a parameter that describes the level of day-to-day variation. 
For engineering applications, we use ¢ = 1.5, 1.7, or 1.84, which are re- 
ferred to, respectively, as low, medium, or high day-to-day variation. For 
first-routed and point-to-point traffic, ¢ = 1.5 is usually appropriate. 

To model the sampling process, we assume that in the sequence of 
message records, each call associated with a given point-pair is included 
in the sample with the same probability p (for CMDS, p = 0.05); i.e., we 
assume a multinomial distribution for the numbers of sampled calls 
belonging to given point-pairs. (For CMDS, the actual distribution is more 
closely approximated by a hypergeometric distribution; however, since 
the number of calls belonging to a given point-pair is a small fraction of 
the total number of calls processed by an RAO, our simplifying as- 
sumption introduces no significant loss of accuracy.) 

Let N; denote the number of arrivals during J; and let hj; be the 
holding time of the ith arrival in J;. Then, with 6;; = 1 if the ith call is 
included in the sample and zero otherwise, 


K Nj 
Cm DD iF (2) 
jJ=1i=1 
is the total number of sampled calls during J = pa Tj, and 
K Nj 
u= 2 : ligoe (3) 
jJ=11=1 


is the corresponding usage. 


lll. LOAD ESTIMATES 


In this section, we analyze three procedures for estimating point- 
to-point loads. The first estimate, d“, is based upon the usage mea-. 
surement, u; the second estimate, 4), upon the attempt count, c; and 
the third estimate, 4G), upon a combination of point-to-point. and 
trunk-group measurements. (Although these do not exhaust the possible 
estimates, they do form the basis for analyzing more complex estimates; 
for example, an estimate of the offered load at 10 a.m. could be based 
upon a combination of the measured loads at 9, 10, and 11 a.m.) In each 
case, we use mean square error (MSE) to measure the accuracy of the load 
estimate, i.e., if d denotes an estimate of the mean offered load, then. 


MSE{d} = E{a — a}? 
= Var{é} + E4{a — a}. (4) 
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3.1 Estimate 1 


Since u [eq. (3)] is a p-sample of usage over K intervals each of length 
t, 


aw= mie (5) 


is an estimate of the corresponding average offered load a. 
In Appendix A, we show that 4) is unbiased, i.e., 





E{é} = a, (6) 
and has variance 
1, 2a 
Varia} = — Ugh: 7 
aria) Pan v4| (7) 
hence, from eq. (4), 
1 2a 
MSE{é ()} = — | —— + |. 8 
{4a} K ea Vd (8) 


In (8), the first term {2a/pt/h} represents the combined effects of the 
finite measurement interval and deviations from the average sample size. 
The second term {vg} is due to (day-to-day) variations in the source load. 
Of course, the factor K is due to averaging measurements over K inde- 
pendent intervals. 

Figure 3 displays the root-mean-square (RMS) error of dé (in percent 
of mean load) as a function of average offered load for sampling rates 
of 5 and 100 percent. The results for a 5-percent sample apply when the 
offered load is estimated using CMDS data, while those for a 100-percent 
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Fig. 3—Sampling error vs offered load. 
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sample apply when the load estimate is based directly upon trunk-group 
measurements. As noted in Section 2.2, errors in estimates of nonbilled 
usage, associated with CMDS estimates, are negligible in comparison with 
sampling error. Also, in applying our results to trunk-group measure- 
ments, we assume that u [eq. (3)] adequately approximates the actual 
usage during the measurement interval J; i.e., we assume that the edge 
effects are negligible (see Ref. 3). Furthermore, our studies have shown 
that the additional variance caused by discretely sampling the usage with 
a 100-second-scan Traffic Usage Recorder is negligible when compared 
with the variance caused by day-to-day load variation. The results shown 
in Fig. 3 assume the standard measurement interval (¢ = 1 hour, K = 20), 
low day-to-day variation (¢ = 1.5), and h = 250 seconds. 

Note that estimates based upon a 5-percent sample can have errors 
that are large relative to those based upon a 100-percent sample. For 
example, for an offered load of 5 erlangs (typical of base year prove-in 
loads for new high-usage trunk groups), the RMS error for a 5-percent 
sample is about 20 percent, compared with an RMS error of about 5 
percent for a 100-percent sample. Similarly, for an offered load of about 
15 erlangs (typical of loads offered to existing Long Lines high-usage 
trunk groups), an estimate based upon a 5-percent sample has an RMS 
error of about 12 percent, while for a 100-percent sample, the RMS error 
is about 4 percent. 

Figure 4 displays the percent RMS error of 4") as a function of the 
sampling rate p for offered loads of 5 and 15 erlangs. The important 
result to note is that the statistical variability of d™ does not decrease 
appreciably as the sampling rate is increased beyond about 20 percent. 
This occurs since the contribution of day-to-day load variation is inde- 
pendent of the sampling rate, and above a sampling rate of about 20 
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Fig. 4—Sampling error vs sampling rate. 
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percent it becomes the dominant source of error. Of course, any im- 
provement in accuracy is significant if the associated benefits justify the 
increased cost for data collection and processing. However, using the 
worth-of-data model of Ref. 2, which quantifies the cost impact of data 
errors on the provisioning of direct final (i.e., nonalternate route) trunk 
groups, we have established 20 percent as an upper bound on a cost- 
optimal sampling rate. To establish an actual cost-optimal sampling rate, 
however, we require a worth-of-data model which applies to more general 
network configurations (i.e., alternate routing networks); such a model 
is currently being formulated. 


3.2 Estimate 2 


The usage-based estimate of offered load (i.e., Estimate 1) is derived 
from attempt count and holding-time measurements. In this section, 
we analyze an alternative estimate based upon an attempt count together 
with an exogenous estimate of the corresponding mean holding time. The 
data collection and processing costs for an attempt-based point-to-point 
data system are less than for a usage-based system; however, we will show 
that the load estimates are substantially less accurate. 

Let fh (a constant) denote an estimate of the mean holding time h. 
Then, since c [eq. (2)] is the total number of sampled calls during I, c/Kpt 
is an estimate of the mean attempt rate \ and, therefore, 


1 
42) = —— of g 
. Kpt © (9) 


is an estimate of the average offered load a. 
In Appendix A we show that 








E{é2} = dh (10) 
so that dG) is biased whenever h + h, and 
vous) E fern an 
hence, from (4), 
MSE{é2)} = ()' ral = — 4 va} + d2(h — h)2. (12) 
h/ K \(pt/h 


Clearly, MSE{4 (2)} depends upon the error (A — h). In practice, the 
same estimate f would be applied to a collection of point-pairs (e.g., all 
point-pairs within an operating company, or all point-pairs served by 
a common trunk group), and our studies have found that the corre- 
sponding distribution of errors (A — h) has a coefficient of variation of 
at least 20 percent. Accordingly, our numerical results will assume that. 
h is in error by 20 percent. 
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Figure 5 displays the percent RMS error of 4‘) as a function of 
- sampling rate for an offered load of 5 erlangs. We assume the same nu- 
merical values for K, t, ¢, and has in Fig. 3, and we assume h/h = 1.2. 
For purposes of comparison, Fig. 5 also displays the percent RMS error 
of 4), as given previously in Fig. 4. 

We draw two conclusions from the results shown in Fig. 5. First, 
‘whereas 20 percent is a reasonable upper bound on sampling rate for a 
usage-based measurement system, a sampling rate of about 10 percent 
is sufficient for an attempt-based system. Of course, if h were known to 
be in error by more (less) than 20 percent, a sampling rate of less (more) 
than 10 percent would be appropriate. But if we know only the coefficient 
of variation of the distribution of h (which we assume to be 20 percent), 
then the average value of MSE{4)}, with respect to this distribution, 
cannot be significantly reduced by increasing the sampling rate beyond 
10:percent. Second, we note that an estimate based upon measured usage 
‘is, for sampling rates greater than about 2 percent, more accurate than 
-an estimate based upon an attempt count. (For sampling rates less than 
2percent, the standard deviation of the measured holding time exceeds 
that of the estimate A; hence, é©) is relatively more accurate in this 
range.) 

In-view of the above results, we conclude that usage measurements 
(when available) are preferable to attempt counts for estimating 
point-to-point loads. However, our studies have shown that the attempt 
count ‘provides a more accurate basis for estimating CMDS nonbilled 
‘usage than does the measured (billed) usage. That is, with CMDS data, 
‘an estimate of the form d= (u-+ Bc)/Kpt is employed, where the first 
term {u/Kpt} is an estimate of billed load and the second term {8c/Kpt} 
is.an estimate of nonbilled load. Thus, 8 can be interpreted as an estimate 
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‘Fig. 5—Comparison of attempt-based ‘and usage-based .offered-load:estimates (offered 
‘load = 5-erlangs). 
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of an average nonbilled holding time per billed attempt. Furthermore, 
we have shown that a small additional improvement can be obtained by 
employing a load-dependent combination of u and c to estimate billed 
load. However, this additional improvement is not significant. 


3.3 Estimate 3 


In this section, we show how the statistical variability of point-to-point 
load estimates can be reduced by combining point-to-point and trunk- 
group measurements. The procedure we describe has been proposed as 
a means for improving the accuracy of CMDS-based load estimates; 
however, we show that the improvement is not significant. 

Consider a trunk group whose total offered load is the sum of N 
point-to-point first-offered loads. For the ith load, i = 1,...,N, let a; 
denote the mean load and let 4!be the (point-to-point) usage-based 
estimate of a;. Furthermore, let T denote the estimate of trunk-group 
offered load based upon trunk-group usage measurements and let A = 
=, a!) denote the corresponding estimate (for the same measurement 
interval) based upon point-to-point usage data. Since T is based upon 
a 100-percent sample, the difference (7' — A) measures the sum of the 
errors relative to the realized loads in the individual estimates d!”. By 
assigning a fraction (w;) of this difference to the individual estimates 
a!) we obtain a new estimate of a; i.e., 


a = 4) + w,(T — A). (13) 
In Appendix B, we show that an approximation to a minimum-vari- 


ance linear estimate of a; is obtained when w; = 4!!/A. Thus, we have 
the ratio-estimate 





‘ST. (14) 


Since a! appears as a summand in A, the ratio T/A is negatively cor- 
related (or tends to vary inversely) with d!?. Physically, it is this negative 
correlation which makes 4!) statistically less variable than d!. 

By employing a first-order Taylor series approximation to 4!°), we 
obtain in Appendix A the following approximations for the mean and 
variance of 4!°): 








E}a}?} © a; (15) 
and 
1 
Var{a{?)} ~ a ; a (L— fill — py + vai (16) 
where 
a; 
fi= . (17) 
yeas) 
j=l 


2856 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1978 


is the fraction of the total load contributed by the ith point-pair. Also, 
since our results are not significantly affected by differences in mean 
holding times, we have assumed that each point-to-point load has the 
same mean holding-time, h. From (4), (15), and (16) we have 


1 
ear {1—f,(1 — p)}+ Kai: (18) 


Figure 6 displays the percent RMS error of 4°) as a function of offered 
load for several values of the parameter f;. We assume a sampling rate 
of 5 percent (the CMDS sampling rate) and the same numerical values 
for K, t, h, and ¢ as in Fig. 3. Note that dis more accurate than 4!” and 
that the relative difference in accuracy is a maximum when /; Saunls one 
(since a{ = T when f; = 1) and approaches zero as f; approaches zero 
(since the variance of T/A approaches zero and, hence, d!*) approaches 
a!” as f; approaches zero). 

The results of Fig. 6 are perhaps more striking when viewed in terms 
of the reciprocal of f;, which can be interpreted as the number (N’) of 
equal-sized point-to-point loads corresponding to f;. That is, the relative 
difference in accuracy of d!” and 4!*) is a rapidly decreasing function of 
N’; for N’ greater than 4, the relative difference is less than about 4 
percentage points. 

Typically, trunk groups carry a large number of point-to-point loads, 
each of which represents a small fraction (f; « 1) of total offered load. 
In this region, Fig. 6 shows that the use of trunk-group measurements 
provides only a small improvement in the quality of CMDS-based load 
estimates. Again, any improvement is significant if the associated ben- 
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Fig. 6—Reduction in rms error afforded by combining point-to-point and trunk-base 
measurements. 
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efits justify the increased development and data processing costs. 
However, based upon the worth-of-data model of Ref..2, we have con- 
cluded that employing trunk-group measurements will not significantly 
reduce the statistical errors associated with CMDS-based estimates of 
point-to-point loads. 


IV. SUMMARY AND CONCLUSIONS 

We have developed a model for quantifying the accuracy of point-. 
to-point traffic measurements as a function of sampling rate and traffic 
parameters. Using this model, we have established 20 percent as an upper 
bound on a cost-optimal sampling rate for a usage-based measurement 
system and 10 percent for an attempt-based system. Furthermore, for 
sampling rates greater than a few percent and loads in the range of en- 
gineering interest, our results show that a usage-based load estimate is. 
more accurate than an attempt-based load estimate. We also showed that 
the accuracy of (CMDS) load estimates could be improved by employing 
a ratio estimate that combines point-to-point and trunk-group mea- 
surements; however, in practical applications, the improvement is not 
significant. Our results, together with a worth-of-data model,” can be 
used to establish requirements for point-to-point traffic measurement 
systems. 


APPENDIX A 
Mean and Variance of Load Estimates 
A.1 Estimate 1 

From eqs. (3) and (5) and Ref. 4, 


Eta} = ae E [z{ > hijbsN| ; (19) 


Since the h;; and 6,;; are independent, and since arrivals during J; are 
Poisson-distributed with rate \;, we have 


ae 
E\E| y hidis|Nj] | = E{N;ph} 
= phE{E{Nj|Aj;}} 
= phE});t} 
= phot. (20) 
Substituting (20) into (19) gives 
E{aé} = dh 


="d. (21) 
Furthermore, since the measurements during each interval J; are 
uncorrelated, (3) and (5) give 
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N; 
Var{a()} = = (Kp ar > Var| 3. hijéij. (22) 


From Ref. 4, 
t Ny Nj 
Var| 3 hjdis} = E) Var| 5. yh hsdijlN3} 
N; 
2 Var|E| 5 hsdis1NG} | 
i=1 


= E{Njh2p(2 — p)} + Var{Njph} 
= \th2p(2 — p) + p*h?Var{N;}. (28) 


Again, given \,;, N; is Poisson-distributed; hence, Var{N;|A;} = E{N;| Aj} 
= Ajt. 
Thus it follows that 


Var{N;} = E{Var{N;|A;}} 
+ Var{E{N;| Aj} 
= E{Ajt} + Var{rjt} 
= dt + t?Var{Aj;}. (24) 
Substituting (24) into (23) gives 
N; 
Var| 3 hij6i| = 2\th2p + p2t2h2Varfy;}. (25) 
i=l 
The offered load during J; is aj = Ajh; hence, vg = Var{a;} = 


h2Var{Aj}. 
Thus, from (22) and (25), we have 





(26) 


Var{é)} = 2 mar ag va}. 


ah 
We now develop an expression, which we require in Section A.3, for 


Covid™, dM | =a}, 
where 
GM |p =F dD hi (27) 


corresponds to a sampling rate of 100 percent. Thus, from (5) and (27), 
we have 


N; Nj 
Sos 2X Corl Y hy dip Dh il; (28) 


From Ref. 4, 


Cov| 5 hjj6ij, x his} = B|Cov| si hi6ij|Nj, sl hislNj|} 
i=1 i=1 i=1 iat 
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Nj Nj 
+ Cov] | oy hisdislNjl, B| 5. hi lj} }. (29) 
1=1 i=1 
We first expand 
Nj Nj 
Cov] hi5ij|Nj, = halNj| 
B| (> hijdij|Nj) (z his\Nj)} -E| > hisdisNj} E i hs IN| 


=1 
= N;{p2h2 + (N; — 1)ph?}- N?ph? 
= jph?. (30) 
Substituting (380) into (29) and using (24) gives 


N; N; 
Cov s! hgdy, hs} = ph2E{N;} + Cov{N,ph, Nh} 
i=1 1=1 


= ph2E{N;} + ph?2Var{N;} 
= 2phat + ptug. (31) 
Thus, from (28) and (31), we have 
: ‘ 1 (2a 
Covia®, dM | ,-y} = K Fa + va}. (32) 


A.2 Estimate 2 


Using expansions similar to those of A.1 it follows, from (2) and (9), 
that 


apy (33) 
and 
& (FY z(t tee (34) 


A.3 Estimate 3 


An approximation for the mean and variance of 4$* is obtained by 
expanding the right-hand side of eq. (14) in a three-dimensional Taylor 
series about the point {E{T}, E{A}, E{é!}}. To first order, this gives the 
approximations 

E{T} | 
Eja®) ~ 24 pig 35 
{a7} EIA} (a;"} (35) 
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and 
Var{a!)} ~ Varfa!)} + f?Var{A} 
+ f?Var{T} — 2f;Covia!, A} 
+ 2f;Covid!?, T} — 2f?Cov{A,T}, (36) 
where 
aj 


aa es (37) 
N 
2d. aj 
j=l sy 
Since the trunk-group measurement 7’ corresponds to a sampling rate 
of 100 percent (i.e., p = 1), we have 


T=A [p=1 
N 
=> a par (38) 
i=l 
Hence, from (21), (35), and (38), it follows that 
Eia)} © aj. (39) 


We assume that the daily source loads (for different point-pairs) are 
uncorrelated;* hence, the estimates d!” are uncorrelated. Furthermore, 
since our results are not significantly affected by differences in the mean 
holding times, we assume that each point-to-point load has the same 
mean holding time, h. Thus, we have 


. oN 
Var{A} = & Varia!) 





i=l 
1 N ( 2a; 
=— as eet 41 
Ki Fan va| 
Var{T} = Var{A| p=1} 
1 N (2a; 
=— —t + ty 42 
aa vai| es 


and 





1 ¢ 2a; 
=— + Ug;}. 43 
K F me | 9) 
Also, from (32) and (38), 


Cov{a!®, T} = Cov{a!?, 4! | pa} 


== Fer cs vai| (44) 


* We have shown that our results are independent of the covariance structure of the daily 
source loads; for simplicity, we assume that they are uncorrelated. 
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and 


LS: Bes rs vl (45) 


Combining (26), (36), and (41) through (45) gives 


1 ( 2a; 2(1— Pp) »» sAUl=p) 
Varia!) rarer i fe a| ya 
aria," t/h Kpt/h “Kpulh ft 2 a5 Kpt/h “Kptlh Vit 
or, since f; 2 Na; = Qj, 
2a; 1 
Var{a!?} ~ ——— 1 -f,(1 - + = vgi. 4 
ar{a;"} Kpt/n fil — p)} Kv (46) 


APPENDIX B 
Minimum Variance Estimate 


In this appendix, we show that Estimate 3 can be obtained as an ap- 
proximation to a minimum variance linear estimate. Thus, from eq. 


(13) 
6) = 4 + wAT — A}. (47) 
This estimate is unbiased, i.e., E{a!?} = E{a!?} = a;, and has variance 
Var{a!} = Var{a!?} + 2w;Cov{a!), T — A} + w2Var{T — A}. (48) 

The value of w; which minimizes the variance satisfies the equation 
0 Var{a{} 





= 0, 49 
Ow; (49) 
which implies that 
Covia!?, A - T} 
(== 50 
eS Var? — A} oo 
From Appendix A, it follows that | 
wi = ai (51) 
N 
Py a 


Pex 
Now if a; is estimated by 4!” so that w; is estimated by d{/A, eq. (47) 
becomes 





— (52) 


Q.E.D. 
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Buffering of Slow Terminals 


By A. G. FRASER, B. GOPINATH, and J. A. MORRISON 
(Manuscript received January 23, 1978) 


In a previous paper, a model for queuing processes with correlated 
inputs was analyzed. To illustrate the application of those results, we 
model a concentrator of slow terminals. The sources of data, the ter- 
minals, generate data at a slower rate than the output speed of the 
buffer. In special cases, we obtain closed-form expressions for the 
generating function of the equilibrium queue size distribution. For the 
general case, we describe a computational procedure to obtain the 
distribution and the average of queue size in steady state. The nu- 
merical results obtained using this procedure are presented for a family 
of problems in which each message consists of two packets separated 
by a fixed time interval. 


1. INTRODUCTION 


A data communications network may be constructed by connecting 
together terminals and switching nodes so that each terminal is con- 
nected to just one node and the nodes are connected together in a 
more-or-less redundant fashion. All connections are by means of 
transmission lines. Those between terminals and nodes are called access 
lines, while those between one node and another are called trunk lines 
(Fig. 1). For economy, the access lines commonly have smaller bandwidth 
than the trunk lines. 

The character of the traffic carried by a data network depends in part 
upon the type of terminal connected to it. Keyboard and display ter- 
minals transmit and receive data messages that are typically less than 
a few hundred characters in length. These terminals operate at speeds 
up to 1.2 Kb/s. Batch stations transmit and receive data in larger quan- 
tities and typically operate at speeds up to 9.6 Kb/s. Trunk lines, and 
the connections to computers, typically operate at about 50 Kb/s. Thus, 
we find that traffic in a data network is not uniformly distributed either 
among the terminals or among the various transmission lines. 

Analysis of delay and the probability of queue overflow is most simply 
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Cr) TERMINAL 
(w) SWITCHING NODE 


—— LOW-—SPEED ACCESS LINE 


== HIGH-SPEED TRUNK LINE 


Fig. 1—Network topology. 


obtained by assuming that all traffic is uniformly distributed between 
the terminals and that all transmission lines operate at the same speed. 
Using these assumptions, Chu! has studied two cases. In one case, equal 
size packets (individual characters perhaps) are generated randomly by 
the terminals. In another case, messages are generated at random, but 
each message consists of a random number of packets which all enter 
the network in one instant. The case of mixed input traffic was studied 
by Chu and Liang.? 

Consider the situation when terminals randomly generate messages 
consisting of several fixed-size packets. The data are fed into a switching 
node over access lines that are substantially slower than the trunk lines 
used to carry data out of the node. Several packets may be transmitted 
on a trunk line in the time that it takes to transmit one packet on an 
access line (Fig. 2). Thus, packets of one message which are transmitted 
consecutively by a terminal will arrive periodically at the node, and the 
period will be greater than the period of packet transmissions on the 


2—PACKET MESSAGE 
FROM TERMINALA \\ 


} 


7 
A 
PACKETS ON TRUNK LINE 
\ 
i 
FIFO / 
ec ce sa age 
/ 


ee Se EQUAL TIME INTERVALS-——~ 
Fig. 2—Example of packet flow at a node. 
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trunk line. Given that packet arrivals are not entirely random, one would 
expect the buffer storage requirements in the switch to be less than is 
indicated by Chu and Liang’s analyses. 

In this paper, we study the behavior of a switching node that receives 
data from a (large) number of low-speed access lines. The data are re- 
ceived in the form of packets of a fixed size. As the packets arrive, they 
are placed in a buffer, which is a first-in-first-out queue. In an actual 
communications network, the buffer has a finite size, and a packet is lost 
if the buffer is full when it attempts to enter it. The buffer transmits 
packets at a uniform rate onto a high-speed trunk, provided the buffer 
is not empty. A crucial question is how large the buffer should be in order 
that the probability of packet loss should be less than 10-4, say. In the 
mathematical analysis of the single queue corresponding to this buffer, 
we consider a buffer of unlimited size so that no overflow is possible, and 
we calculate the steady-state probability that the buffer content (i.e., 
the number of packets in the buffer, or queue size) exceeds the proposed 
size of the finite buffer. We refer to this quantity as the probability of 
overflow, since it is usually used to estimate the actual overflow proba- 
bility, when the probability of packet loss is very small. 

Some of the symbols used are listed in Section IT. The mathematical 
model, which includes assumptions concerning the packet arrivals, is 
discussed in Section III. This model was analyzed by two of the authors,* 
and formulas for calculating the equilibrium queue size distribution are 
summarized in Appendix A. These formulas involve some marginal 
distributions, and some polynomials which have to be determined. Ex- 
plicit analytical expressions for the coefficients in these polynomials are 
given for some particular examples in Section IV. The computation of 
the coefficients for more general examples is discussed in Section V. The 
computation of the marginal distributions is discussed in Section VI. 
Some numerical results are presented in Section VII. 


ll. NOTATION 
bn buffer content at time n 
2a number of packets entering buffer in time interval 
(n,n + 1] 
k+1 number of time intervals required for message arrival 
a nonnegative integers with aj, > 0 
(x1, ...,x4) independent identically distributed vector of 


nonnegative integer valued random variables. 
ill. MATHEMATICAL MODEL 
The behavior of a switching node is modeled by considering the state 
of the buffer at discrete times. Suppose that the buffer transmits one 


packet in a unit time interval. If b,, denotes the buffer content at time 
n, the buffer content at time n + 1 is 
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bn4i1=b,-1+2, ifb, 21 
=z, ifb, =0 
or equivalently 
bn4i = (bn — 1)* + 2n, (1) 


where z,, is the number of packets entering the buffer in the time interval 
(n,n + 1]. The peculiar feature of our queuing problem arises from the 
fact that the access lines are slower than the trunk line on which the 
output of the buffer is transmitted. For example, the 9.6 Kb/s access lines 
operate approximately at only one-fifth of the speed of a 50 Kb/s trunk. 
Then, in our discrete model, messages arriving from the access lines will 
consist of a random number of packets separated by five units of 
time. 

Let us first focus our attention on the case when there are two packets 
to each message. If the packets are separated by d units of time, then the 
number of packets entering the buffer in the interval (n,n + 1] will be 
equal to the number of first packets in messages arriving in that interval, 
plus the number of second packets in messages whose first packets ar- 
rived d intervals earlier. Then z,, = x, + X,n—g, where x, is the number 
of first packets arriving in the interval (n,n + 1]. Now consider the case 
in which the messages are transmitted by N independent sources. The 
probability is zero that the first packet of a message from a source enters 
the buffer in a unit time interval if the first packet of a message from that 
source entered the buffer in one of the previous (2d — 1) time intervals, 
and otherwise it is p. Then 


d . : N-Iy\ . 
Prix, = toltn=j =1;, J= 1. ee } = ( ; )pio(1 a p)N-I-io, 
: lo 


where I = Dey ‘ij. Thus, x, and x,-g are not independent random 
variables. However, if N — © with Np = fixed, then Prix, = io} > 
(e—X!0)/ig!, independently of x,-;, j = 1,2,.... Hence, if the number 
of sources is large, it is reasonable to assume that the random variables 
Xn, n=0,1,..., are independently and identically distributed, and we 
will not restrict ourselves to the Poisson distribution. 

Next we consider the case where each message consists of either two 
packets, separated by d units of time, or of only one packet. If we let x} 
denote the number of first packets of two-packet messages arriving in 
the interval (n,n + 1], and x2 denote the number of single packet mes- 
sages arriving in the same interval, then z,, = x} + xj_4+ x2. In order 
to allow for randomness in the number of packets to a message (either 
one or two), x} and x2 may be dependent on each other. For example, 
if the number of packets in a message is two with fixed probability 
1 — p, and one with probability p, where 0 < p < 1, then E(t?""t3"’) = 
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O[(1 — p)t1 + pte], where O(t) is the distribution for the number of 
messages arriving in a unit time interval. As before, when there is a large 
number of independent sources, it is reasonable to assume that (x},x2), 

= 0,1,..., are independent identically distributed vector random 
variables. 

There are some obvious generalizations of the above special arrival 
processes. For instance, the random number of packets to a message may 
be as large as m 2 2. Also, there may be slow access lines with several 
different speeds, so that the packets in a message may be separated by 
d; units of time, i = 1,...,r. This led two of the authors‘ to consider 
arrival processes of the form 


=> > ae (2) 
i=1lj= 
where the vector nonnegative sales valued random variables 
{(x},...,x4)} are independent and identically distributed, and aj are 


riontiegative integers with aj > 0. In Appendix A we summarize the 
relevant results pertaining to the steady-state distribution of the queue 
size corresponding to (1) subject to (2). 


IV. EXPLICIT EXAMPLES 


To calculate the generating function of the steady-state queue size 
from (37) and (38), it is necessary to know the polynomials c,(s), r = 
1,...,R, as well as the quantities ¢,,(s), r= 0,...,k. Hence, from (39), 
we need to know the constants c;. We now give explicit analytical results 
for some particular examples. 

We first consider the arrival process 


Zn =xitxi_,ptx2t x3, (3) 
where k 2 2, and 
1 ‘ 
E(te*es*ta") = O[(1 — p)ty + pte] (ts), (4) 


with 0 < p < 1 fixed. This corresponds to arrivals from two different 
classes of sources. One class of sources sends messages which consist 
either of two packets separated by Rk units of time with probability 
1 — p, or of one packet with probability p. The other class of sources sends 
messages which consist of just one packet. From (2), (3), and (4), and the 
definition of v,, in (31), it follows that 


dory(s) = O(s) W(s), r=0,...,R—-1, 
brv(s) = O[(1 — p)s? + ps] V(s). (5) 


It remains to give the values of cj, which we do for k = 2 and 3, with 0 < 
p <1, and for k = 4 with p = 0 and W(s) = 1. It is shown in Appendix C 
how to determine which constants c; occur in (46). It is also shown how 
the values of c; were calculated in the case k = 3. 
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Let 
O(s) = ¥ pists W(s)= X qist. (6) 
i=0 i=0 
For example, the case p; = e~*A‘/i! corresponds to a Poisson distribution 


for the number of messages arriving from the first class of sources in a 
unit time interval. For k = 2 the nonzero coefficients cj are 


C005 C11 = (1 — p)p1g0Co0, (7) 
with coo + C11 = 1. For k = 3 the nonzero coefficients c; are 
C000, €o11 = (1 — p)P1G0C 000, 
C122 = (1 — p)?p?qicooo, C229 = (1 — p)*pop2gécoo0, (8) 
and 
_ (1= p)golpi(1 + pogi) + 2epope2qol 
n= FE 0 (9) 
[1 — (1 — p)popiqé] 
with 2cej = 1. 


For k = 4 we take p = Oand W(s) =1, corresponding to z, = Xx, + Xn—4, 
with E(s*=) = @(s). In this case, there are 14 nonzero coefficients cj, 
namely 


C0000 = Co; Co011 = P1C0; Co122 = DiCo, 
C1233 = P}Co, C0222 = PoP2Co, C3333 = P6P3Co, 
C1333 = C2333 = C2233 = PoP1P2Co, (10) 
and 
Co111 = PiAco, C1122 = pi Aco, 
C1922 = pi(1 + popi)Aco, —-€2222 = Pop2(1 + Pop) Aco, 
C1111 = Pill + po(pi + Pop2)(1 + pop1)] Aco, (11) 
where 
A = {1 — popill + pa(pi + pop2)(1 + popr)]}-t. (12) 
We now consider a class of arrival processes z,, for which the polyno- 
mials c,(s), r= 1,...,k, are, in fact, constants. Then the first two mo- 


ments of the equilibrium queue size distribution, Eyo and E(yé), may 
be expressed, with the help of (59) to (61) and (63), in terms of the first 
three moments of u,,, since c’,(1) = 0 and c”,(1) = 0. The class of arrival 
processes we consider corresponds to 


ai > 0, j= Ope. 5 Ji b= Tycetoh, 


a’ = 0, otherwise, (13) 
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in (2). We will show that yo, = 0 implies y,, = 0,r=1,...,, and hence 
that 
$(0,s1,... Sk) = w= 1 — Evpn. (14) 
This implies, from (39), that c,(s) = n,r=1,...,R. 
Now, since by, = Yon, it follows from (29) and (18) that yo, = 0 implies 
that 
x},_j-1 = 0, J=0,... Jz L= \ifesrarer (15) 
lip > J;, then al = Oforj 2r4+1. If, on the other hand, r +1< Jis 
then x;,-j+, = Oforr +1<j <jj, and aj = 0 forj > j;. Hence, ajx),-j+, 
= Oforj=rt+1,...,k,i=1,...,,r=0,...,k —1. Thus, from (380), yon 
= 0 implies y,, = 0,r = 1,...,k, as was asserted. 
As a particular example, we consider messages which consist either 
of one packet or of two or more packets (up to a maximum number) 
which arrive in consecutive time intervals. Such is the case when the 


access lines have the same speed as the output line. For this example, 
the arrival process is 


Ghose (16) 


Hence, from (2),/ =k + 1and 

pd J SOs Lea Lay rl 
ae ee eee es ee 
so that (13) is satisfied, and c,(s) = n,r=1,...,k. From (81) it follows 
that 


(17) 


wi=min(iir+1), r=0,...,k, i=1,...,.kR+1 (18) 


and 


k+1 
Von = do Xn 
1=1 


oe k+1 : 
De I Dit om 0 ic ol a (19) 
t=1 i=rt+1 


Let pj 2 0,i = 1,...,k + 1, be the probability that there are i packets 
in a message, where D+} p; = 1. Then, 


1 k+1 k+1 
B(ti ... tH) = © (> pits) (20) 
Hence, from (19), 
fov(s) = O(s), 


 k#1 
bro(s) = © (X pisi + yas) eee (21) 
i=1 i=r+1 
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V. COMPUTATION OF c,(s) 


As described in Appendix A [see (34)], the constants {c ;} determine 
the polynomials c,(s) from (39). In principle, using a program like AL- 
TRAN,” which performs symbolic manipulations, it is possible to sub- 
stitute (34) into (33) and equate coefficients of like powers on both sides 
of (33) to get the equations for the {c ;}. However, we will describe an al- 
ternate method that was used for obtaining the numerical results pre- 
sented in Section VII. From (36) it is seen that 


oan Pale aS Fee Te (22) 


The method presented here arrives at equations for {Po;} directly from 
the equations for {P;}: from (32) 


| De Pio-j1i1—i2,... ik inPj 
Jo=0,jeA 


+ OD Din-jr+1it—jott, ... ix—jet Pj (23) 
Jo>0,jeA 
The sequence of programs that were used to determine Pj, ... ;, are 
described briefly below. 

(i) The first step is to generate a sufficient number of equations from 
(23). Starting with i = (0,0,...,0), the right-hand side of (23) is found 
in symbolic form. Using a test for determining admissible states, every 
index j appearing on the right-hand side of (23) that corresponds to states 
not communicating with (0,0, .. . ,0) is omitted. Corresponding to each 
new index that arises on the right-hand side of (23), a new equation is 
generated from (23) by setting i equal to the new index. This process is 
terminated when every new index generated has ig 2 k. This whole 
process is repeated starting with each index with io = 0, 1.e., for each 
Poj,... j,. The total number of equations and the total number of un- 
knowns are counted. The output of this program is this set of equations 
in symbolic form. The number of equations turns out to be always less 
than the number of unknowns. However, we know‘ a set of linear ho- 
mogeneous equations must exist for Po;, .. . ;,. In the examples consid- 
ered, visual inspection revealed a few substitutions which made the 
number of equations one less than the number of unknowns. The details 
of aspecific procedure for accomplishing this, in the case / = 1 in (2), have 
recently been given by Massey and Morrison.® 

(it) The normalizing constant which determines Pj, .. . ;, uniquely 
is easily seen to be (u/ZPoj, . .. j,). This program starts with one of the 
unknowns found in step (i) above and sets its value equal to 1. Then it 
determines the maximum number of other unknowns that can be de- 
termined from this recursively, i.e., without having to invert any matrix. 
The best unknown to fix, the one that minimizes the number of un- 
knowns to be solved by inverting a matrix, is selected and set equal 
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to 1. The matrix corresponding to the equations for the other unknowns, 
and the right-hand sides for these equations, are then generated in 
symbolic form. The output of this program is then a subroutine which 
generates the coefficients of c,(s) in symbolic form, as functions of the 
given probabilities. Some excerpts are shown in Tables I and II. In Table 
I is the “triangular” part of the equations for k = 5. R;,._.. ;; here is 
Pio... is normalized such that Poooooo = 1 and P(t) = Pr{x, =i — 1}. In 
Table II are equations for generating the coefficients of c,(s) from 
Rig 2. U5 

Vi. COMPUTATION OF MARGINAL DISTRIBUTIONS 


Equations (37) and (38) give expressions for the generating functions 
of the equilibrium distributions of yjn, 7 = 0,1,...,k. As in (80), yon is 
b,, the queue length. Hence, the equilibrium queue size distribution has 
as its generating function ¢0(s). If wo; = limnt. Pr{b, = Jj}, 7 = 0,1,..., 
then $o(s) = Z79 70js/. One way to find {7o;} is to start with ¢; (s) and 
iterate using (38), thus obtaining an expression for ¢o(s), then to inverse 
transform ¢o(s). For example, using s = e~/”, we can treat ¢o(s) asa 
function of w, and then finding {79;} corresponds to finding the Fourier 
series for do(e 77”). 

We will present a different method here. Generally, the quantities of 
interest are 

Io; = ¥ 70; = lim Prib» <j} fori <N, 
1=0 nfo 


Table | 


Ry = ((1—P(1))/P(1))*R ooo000 Ro20356 = (P(2)/P(1))*Riti244 


Ryii12 = (P(2)/P(1))*Rooooo0 
Roo200g = (P(2)/P(1))* Rizzi 
Ro29904 = (P(3)/P(2))* Riri 
R333335 = (P(3)/P(2))*R 209993 
R333336 = (P(4)/P(3))*R220004 
R 444447 = (P(4)/P(8))*R333335 
R 444448 = (P(5)/P(4))*Rassa36 
oooo11 = P(1)* Riri 
Ryi1123 = (P(2)/P(1))*Rooo011 
R111133 = P(1)*Ro20004 
Ro999935 = (P(3)/P(2))*R 111123 
Ro92045 = (P(2)/P(1))*Ri11133 
R220955 = P(1)*R333336 
R333347 = (P(4)/P(3))*R2292035 
333357 = (P(3)/P(2))*R 222045 
R333367 = (P(2)/P(1))*R 20055 
333377 = P(1)* A 444448 
Rooo122 = P(1)*R111193 
Rooozee = P(1)*Riiiiss 
Ritiesa = (P(2)/P(1))* Roooree 
R111334 = (P(2)/P(1))*Rooo222 
111244 = P(1)*Ro292035 


111344 = P(1)*Ro22045 
Riii444 = P(1)*Ro29955 
Ro20346 = (P(3)/P(2))* Risses4 


Roooaag = (P(3)/P(2))*Ririss4 
Roveas6 = (P(2)/P(1))* Ririsa4 
Ro20556 = (P(2)/P(1))*Riri444 
R220366 = P(1)*R333347 
Rose466 = P(1)*R 333357 
Ro20566 = P(1)*R333367 
Re20666 = P(1)*R333377 
Roo1233 = P(1)*Ri11234 
Roois33 = P(1)*Riri24a 
Roo02233 = P(1)*R111334 
Roo2s33 = P(1)*Ri11344 
Roo3333 = P(1)*Rii1444 
Rijo345 = (P(2)/P(1))*Roo1233 
Ry12445 = (P(2)/P(1))*Roo1333 
Ry13345 = (P(2)/P(1))* Roo2233 
Ri13445 = (P(2)/P(1))* Rooe3a3 
Ry14445 = (P(2)/P(1))*Roosa33 
Ry412355 = P(1)*Re22346 
Ry10455 = P(1)* R222356 
R112555 = P(1)*Ro20366 
R413355 = P(1)*Roe2446 

113455 = P(1)*Ro20456 

113555 = P(1)*Ro20466 
R114455 = P(1)* Roo0556 
R414555 = P(1)* Ro22566 
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Table Il 


C(1,1) = Rooooos + Roooo11 + Rooo111 + Rooo122 + Rooo222 + Roo1111 + Roo1122 

C(1,1) = C(1,1) + Roo1222 + Roo1233 + Rooi333 + Roo2222 + Roo2233 + Rooz333 + Roosaa3 
C(1,2) = Roii11 + Rors122 + Ro1i222 + Ro11233 + Ro11333 + Ro12222 + Ro12233 

C(1,2) = C(1,2) + Roiess3 + Roies44 + Roresss + Roiz333 + Roisz44 + Roigssa + Rorssaa 
C(1,3) = Rov2202 + Ro22033 + Roz2333 + Ro22344 + Roze444 + Ro23333 + Rov3344 

C(1,3) = C(1,3) + Roosaaa + Roosaaa 

C(1,4) = Rossaaa + Ros3344 + Ross444 + Rosa4aa 

C(1,5) = Rosa4aaa 

C(2,1) = Roooooo + Roooo11 + Rooo111 + Rooo1z2 + Rooo222 


C(2,2) = Roor111 + Roo1122 + Rooi222 + Roo1233 + Rooisa3 + Ro11111 + Ror1122 

C(2,2) = C(2,2) + Roi1202 + Roiie33 + Ro1is33 

C(2,3) = Roog202 + Roove33 + Rooes33 + Roiz222 + Ro12233 + Roi2333 + Ro12344 

C(2,3) = C(2,3) + Roresaa + Ro22202 + Ro22033 + Rov2333 + Rove344 + Rov2444 

C(2,4) = Rooss33 + Ro1sss3 + Ro13344 + Rors444 + Roosa33 + Roo3344 + Roosasa 

C(2,4) = C(2,4) + Rosssa3 + Rosasaa + Rossaa4 

C(2,5) = Roiasaa + Rooaaas + Rosgsaaa + Rosas 

C(3,1) = Roooooo + Roooo11 

C(3,2) = Roooi11 + Rooo1e2 + Rooo1111 + Roo1122 + Rori111 + Ro11122 

C(3,3) = Rooozee + Rooi222 + Roo1233 + Roo2222 + Roo2233 + Ro11222 + Ro11233 

C(3,3) = C(3,3) + Ro12202 + Ro12233 + Ro22222 + Ro222033 

C(3,4) = Roo1333 + Roov333 + Rooss33 + Ro11333 + Ro12333 + Roi2344 + Ro1ss33 

C(3,4) = C(3,4) + Roiss44 + Rove333 + Rozes44 + Ro23333 + Rov3344 + Rossa33 + Rossa44 
C(3,5) = Ro1esa4 + Roisaas + Roiagaa + Rov2a44 + Roosaas + Rovsaaa + Rosaaaa 

C(3,5) = C(3,5) + Rosaaaa + Rosaaaa 

C(4,1) = Roooooo 

C(4,2) = Roooo11 + Rooo111 + Roo + Ror1111 

C(4,3) = Rooo122 + Roooz22 + Roo1122 + Roo1222 + Rooze22 + Ro11122 + Ro11222 

C(4,3) = C(4,3) + Ro12222 + Rov2229 

C(4,4) = Rooies3 + Pootsss + Roo2233 + Roo2333 + Roos333 + Ro11233 + Ro11333 

C(4,4) = C(4,4) + Ro12233 + Ro12333 + Ro13333 + Ro22233 + Ro22333 + Ro23333 + Ross3a3 
C(4,5) = Rows Ro12444 + Rois344 + Roisa4a + Roisaaa + Rov0344 + Rozesaa 

C(4,5) = C(4,5) + Roossa4 + Rozsaaa + Roosaaa + Rozszaza4 + Rosaaa4 + Rozaaag + Rosaaas 


where N is some initially selected constant. The method presented here 
determines IIo;, 7 < N, uel in a finite number of additions and 
multiplications. Let ¢,(s) = Zj=o x,js/ and II; = Yio 7,7. We can write 
(37) as 


(1 — s) pv (s) - cx (s) 

br (s) OSs (24) 
Denote ¢;(s)/(1 — s) by ¥z(s), so that Y,(s) = 22 yjs/ for |s| < 1. 
Therefore, 
cr(s) = bev(s) 
Wr(s) Glas? 
Similarly defining y,(s) = Do Ijs/ for r = k — 1,k — 2,...,0, we 
have 


|s| <1. (25) 


Y(s) = s—[¥,+1(s) _ ¢r+1(8)]@rv(s). (26) 


The functions $7, (s) are, of course, determined from the distributions 
of x;, and the constants ai. 

For each r, the process of determining I1,; from II,+1,; can be described 
as follows. Subtract the known constants c,+1,; from II-+1,; for 7 < degree 
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of c,41(s). Since I,+1,0 = ¢r+1,0, the sequence 6,; corresponding to 
s—1[W,+1(s) — ¢-+1(s)] is such that 6,; = 0 for j < 0 and 


nee Tl +1,j+1 — ¢r+1j+1 for 1 <j + 1 < degree of c,+1(s) 

Il-+1,j+1 for j + 1 > degree of c,+1(s) 
In order to get {II,;}, we now convolve this sequence {65,;} with the se- 
quence corresponding to ¢,,(s), say {p;;}. Therefore, I; = Dleo 6, ;—-iDri- 
This process involves only a finite number of multiplications and addi- 
tions as long as only a finite number of II,;’s are sought. Notice that in 
order to determine IIo;, 7 < N, we have to start with values of T;; for J 
<N+R. 

We will now show that there is a recursion which allows us to compute 
Iizj,] < N +k, ina finite number of arithmetic operations. Let II’,; 
correspond to $zv(s)/(dxv(s) — s) and let dgy(s) = Ufo Pajs. Then, 
equating coefficients of like powers of s on both sides of 


= : prv(s) 
ys) 2 (27) 
=o” bv (8) — 
we can derive the following. 
Il’zo = 1, 


J : 
W’,j = (ppj + W’n,j-1 - > Prill’ej-i)/Pro, J =1,2,.... (28) 
= 


Once the II’;,; have been determined, we convolve the sequence {I1’;;} 
with {cz;} to get {I1z;}, as seen from (25). Summarizing, we have shown 
that Io; for 7 < N can be determined from c,(s),r = 1,...,k and ¢,y(s), 
r=0,1,...,k, by performing only a finite number of multiplications and 
additions. The method described was used in calculating the probabil- 
ities presented in Section VII. 


Vil. AN EXAMPLE OF THE CALCULATIONS 


We calculated the queue size distributions for various traffic inten- 
sities £z,, for the following queuing models: . 


(i) bn4+1 = (bn — 1)+ + 2x,, R=0 

(it) bn+1 = (b, = 1)t + Xn oe Xn-3) k=83 
(tii) bn41 = (bn, — Lt t+xntxn-4 R=4 
(iv) bn41 = (6, —1)* +Xn4+Xn-5, R= 5, 


where the i.i.d. random variables x,, are assumed to be distributed ac- 
cording to the Poisson law. These cases are referred to equivalently by 
referring to the value of k. The objective was to determine what buffer 
size would suffice for a concentrator used to buffer terminals that gen- 
erate two packets per message but are slower than the trunk line. We 
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present here some of the results. In Fig. 3 the abscissa corresponds to 
traffic intensity: average number of packets transmitted on the trunk 
line per unit of time. In the steady state, the probability that the queue 
size exceeds that shown on the ordinate is less than 10~4 for each value 
of traffic intensity. The cases when all packets arrive at once (k = 0) or 
“infinitely apart” (k t ©) are shown, as well as the case k = 5. Whenk 
= 0, the queue size corresponding to each value of traffic intensity is 
either twice that of the case k = —, or one less than twice.* These two 
cases are further compared in Fig. 4, this time for buffer sizes 20 and 40, 
and the logarithm to base 10 of the probability of the queue size ex- 
ceeding 20 and 40 is plotted. 

For fixed traffic intensities, the change in the probability of overflow 
as a function of buffer size can be seen from Figs. 5, 6, and 7. From these 
figures we can see that for low traffic intensities and low buffer sizes there 
is a difference between the batch case k = 0, and k = 5 (see Fig. 5), but 
for larger traffic intensities the difference decreases substantially. From 
the formulas (37) and (38), it can be shown that the tail of queue size 
distribution is geometric for each value of k. Furthermore, for all finite 
values of k, the common ratio is the same as that of the case when k = 
0, so that the similarity in the behavior of queue size distributions for 
large values of queue size is as expected. The slopes of the curves marked 
k = 5 in Figs. 5, 6, and 7 approach those of the curves marked k = 0 for 
large values of the abscissa. Hence, in applications where probabilities 
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Fig. 3—Buffer size vs traffic intensity for k = 0, 5 and ~, and probability of overflow 
< 1074. 


2876 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1978 


K BUFFER 
“\ 77 SIZE 


-_—_—— 


_ 
(0,20) 

(0,40) ~— — 
(c0,20) -—~ 


(20,40) _ 


LOG,, PROBABILITY OF OVERFLOW 





0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 
TRAFFIC INTENSITY 


ee 4—Probability of overflow vs traffic intensity for k = 0 and ~, and buffer sizes 20 
and 40. 


of overflow of 10~7 or smaller are required, the relative slowness of the 
sources of packets does not seem to reduce the buffer size required. 
Packets may be assumed to arrive simultaneously for the purpose of 
estimating the buffer size. 


APPENDIX A 
Summary of Formulas 


We here summarize formulas for calculating the equilibrium queue 
size distribution. With a model for the input process 2, as in (2), the 
queue size b,, is described by 


L Rk ae 
bn+1 = (b, = 1)t i 2 xX Anes (29) 
1=1j= 
Various formulas pertaining to (29) were derived.* The reader is referred 
to Ref. 4 for proofs of the formulas presented here. Define yo, = b, and, 
forr =0,1,...,k —1, 


L ok eee 
Yrtin =Ynt DV O5X n—j+r- (30) 
i=l jert1 
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TRAFFIC INTENSITY = 0.4 
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Fig. 5—Probability of overflow vs buffer size for k = 0, 5 and &, and traffic intensity 
0.4, 


Note that yon < Yin <... <n. The vector process (Yon,Vins - - - -Vkn)> 
n= 0,1,2,..., is Markov under the assumption that (x},x2,...,x},),n 
= 0,1,2,...,is a vector sequence of independent identically distributed 
random variables. The Markov chain, denoted by S, which corresponds 
to (Yon,Vin» -- -»Vkn), was shown to be positive recurrent when Ez, < 
1. The states of S are those which communicate with (0,0, . . . ,0), since 
it is assumed that the buffer is empty and no one is transmitting initially, 
atn = 0. Let 


r ; 5 at ap i 
Saja ah Vwi = vm r= Oly. k, (31) 


The probabilities that enter the calculations turn out to be only those 
corresponding to the random variables u,,, r = 0,1,...,k. Let Digi; .. . ix 
= Pri{von = io, Vin = i1,.. - kn = iy}. Then the transition probabilities 
for S are given by: 


n+1 — eR aie, Se .  . pn 
PR = 2 Dio-yirmia,.. . tkinP§ 
Jo=0,jeA 


+ ; ~ Pio—jit+1,i1—jot+l, ... inmjeeil j: (32) 
Jo>0,jeA 
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TRAFFIC INTENSITY = 0.6 
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Fig. 6—Probability of overflow vs buffer size for k = 0, 5 and ~, and traffic intensity 
0.6. 


The sums in (32) are over those indices Jo,/1, . . . Jz Which correspond 
to states communicating with (0,0, ... ,0) denoted by A. As mentioned 
above, when Ez, < 1, then lim, P? = Pj exists and Pj satisfies (32) with 
P? and P}'*! both replaced by P;. The generating function corresponding 


to P;, 


$(So,S1,.-- Sk) = D Piss... si? 
icA 
satisfies 


Rk 
(s0,81, see Sk) a [6(1,50,51, coe »Sk—2,Sk—18k) I] ect 
i=0 


h 
+ (1 = re 7) P(0,80,81, - - » Sk-2,5k-18k) bv (S081 - - - 8k), 
= 


where 


k 
hy (S0,81,... Sk) = E TT sPir. 
i=0 
It can be shown‘ that ¢ has the representation 


(0,81, -- - Sk) = 22 c;0j(So, .- . 8x), 
J 
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(33) 


(34) 
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Fig. 7—Probability of overflow vs buffer size for k = 0, 5 and ~, and traffic intensity 
0.8. 


where 6; (so, . . . 52) are solutions of 


k 
6; (0,81, ey'e0¢ Sk) = [6;(1,s0,s1, ce Sk—2Sk—1Sk) II se 
i=0 


k a7) . . 
+ w(1 — TI 57")sd's}?.. . sHEX(sp—-18k)*#] 60 (S081, --- 8k), (35) 
i=0 


with p = 1— Ez, = 1 — Evzn. Substitution of (34) into (33) yields a sys- 
tem of equations for cj; which together with 2c; = 1 uniquely deter- 
mine 


~(0,81,---,Sk) = bE zs c;st! ... Sth, (36) 
j 


Once (0,51, ...,8%) is found, the generating functions corresponding 
to the marginal distributions, namely lim,;. Es» = ¢;(s), are solutions 
of 


(1 = sNex(s)brv(s) 


iO= a (37) 
where ¢j,(s) = Es¥n, 7 = 0,1,...,k, and, forr =k —1,...,1,0, 
obr(s) = [s~1b-41(s) + (1 — s~)er41(s)]brv(s). (38) 
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Here the polynomials c,(s), r= 1,...,k are of finite degree and are given 
by 


r 
Cn(S) = POL ce8) Loewe Ll) HD Chijos. ieoncdes (39) 
J 


Then (37) uniquely determines ¢;(s). Using (38) k times yields ¢o(s), 
which by definition of yon [see (80)] is the generating function of 
steady-state queue size. 

An alternate representation of the generating function corresponding 
to P; is obtained by setting u; = Nh; si;,] = 0,...,k and defining 


$(so,81, tee Sk) = P(U0,U4, soe jp) (40) 
Then, corresponding to (83), 
B(Uo,U1, ose ,Up) = [ug b(uo,wo,W1, tee ,Up—1) 
7 (1 >< uo +) &(0,uw0,t1, sae Up—1) | Py (Uo,U4, toe Up), (41) 


where 
k l ee 
by (Uo, ... Ue) = E ( Il ur), Win= Laixi, (42) 
r=0 i=1 
It follows from (41) that, for 7 =0,1,...,k — 2, (k = 2), 


®(S, 645 58,Ua, s+» pay) = [s“18(s,.. «58,01; 0. sepa) 
+ (1 —s71)@(0,s,...,8,U4,... ipsi=i)| 
X B,(S,...,8,U1,...,Ur—j), (48) 


and 


P(s,... ,8,Uz) = [s—10(s, ... ,s) 
+ (1 —s—!)(0,s,...,s)]@,(s,...,8,U1). (44) 


If we set wu; = s in (44), and solve for ®(s,...,s), we obtain 
_ A —s)&(0,s, ... ,s)P,(s, ... ,8) 
[b,(s,...,8) —s| 


It was shown‘ that 0(0,u1,...,u,) is a multinomial independent of 
Up. From (48) to (45), ®(s,u1,...,Uz) may be expressed in terms of 
(0,5, ... ,8,U1,... Up—j-1),J = 0,...,k — 2, and B(0,s, ...,s). If we let 
s — 0 in this expression, and equate ®(0,u4, ... ,uz) with the finite part, 
we obtain a system of homogeneous linear equations for the coefficients 
in the multinomial. In general, we also obtain a (consistent) set of ho- 
mogeneous linear equations from finiteness conditions. The normali- 
zation condition is (0,1,...,1) = yw. From (86) and (40), it follows 
that 


®(s,...,8) (45) 


ae) ee 
(Oj; ese) Hal Ge fe Tk (46) 
j t=2 
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Also, from (39), 
r k-r 
cr(s) = €(0,s,...,8,1,...,1). (47) 


Formulas for calculating the first two moments of the equilibrium 
queue size distribution are derived in Appendix B. 


APPENDIX B 
First and Second Moments 
We here derive expressions for the first two moments of y, = limy--« 


y¥rn. We note that yo represents the equilibrium queue size. By defini- 
tion, 


or(s) = E(s>), frv(s) = E(s’™), (48) 
forr =0,...,k. Hence, 
ér(1)=1, $’7-(1) = Eyr, $”-(1) = E(y7) — Ey, (49) 


and 
Pro(1) = 1, 6’ w(1) = Emm, 6” (1) = E(Uin) — Ev. (50) 
We also note that, from (39), since 2 c; = 1, 
cye(1) = w= 1— Evpgn, r=1,...,. (51) 
From (37), 
(1 =8) 


-_ 52 
Pe oe 


bp(s) = cr(s)dur(s)f(s), f(s) = 
and, from (38), 
or(s) = [s~46-41(s) + (1 —sYersils)]orv(s), r=0,...,k —1. 
(53) 
If we differentiate (53), we obtain 


b’(s) = [—s—2bp41(s) + 8716’-4-1(s) + 8 ~2er4-1(s) + (L — se’, 41(s)] 
X drv(s) + [s~!r41(s) + (1 — s7)er4i(s)]b’ro(s). (54) 


If we set s = 1 in (54), and use (49) to (51), we obtain 
Ey, = Ey,41 + Evpn — Even, r=0,...,k — 1. (55) 
Next, if we differentiate (52), we find that 


6’ (s) = [c’n(s)bpu(s) + cr(s)b’kv(s)]f(s) + cn (s)bnv(s)f’(s). (56)? 
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But ; 
ry (s) = 1+ (s — 1)’x, (1) + 5 (s — 1)?6” pny (s) 


+ ; (s — 1)86"~(1) +... (57) 
Hence, from (52), 
ee (5 — 1)6"mo(1) 
[1 — @’x.(1)] — 2[1 — 6’e, (1)? 
—= 2 mM uw 9 
(s 1) w) kv (1) Io kv (1)] ass e (58) 


2 3[1-— ¢’po(1)]? = 2[1 — pn (1) 
If we let s — 1 in (56), and use (49) to (51) and (58), we obtain 
c’,(1) [E (ven) a Evpn| 
Ey, = = + Ev, + —— > 
oS Ge hye) he Ono) 


From (55), it follows that 
k-1 
Ey; = Eyp + Ds Ev, — (k — j)Evpn, J =O k= 1, (60) 
r=] 


(59) 


This determines Ey;, and in particular Eyo, in view of (59). 
For the second-order moments, if we differentiate (54) and set s = 1, 
we obtain 
E(y7) = E(yius) — Eyrt1 + Ey, + E(Uin) — Evrn 
+ 2(1 ae Ev;n) (Even = Ey,+1) 2a 2c’p+1(1), (61) 
forr =0,...,k — 1. Finally, if we differentiate (56) and let s — 1, and 
use the relationship 
ob” ny (1) = E(ugn) — 8E (vin) + 2E (pn), (62) 
which follows from (48), we obtain, after some simplification, 


E(y2) = Ey, 4 dete (1) + 2Bopnde’eQ)] [E (vin) — Evpn|e’r (1) 


(1 — Evin) Gi =o.) 
[E(uzn) i Evpn|\ [E(v7,) a Evpn|? 
aie ae ne 


An expression for E(y@) may be obtained from k applications of (61), with 
the help of (59), (60), and (63). 


APPENDIX C 
Determination of Constants 
We first show how to determine which constants c; occur in (46) for 

the example of (3). From (2) and (30), 

Yon = bn, Yr+ijn — Yrn = ee es 0, te sk Su (64) 
Hence, 

k 
P(uo,Uy,...,Ur) = lim EF (ube II uph-trt). (65) 


n—-o r=1 
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But from iteration of (29), it is evident that 
by =O Yang I= 1, l=1,...,k, (66) 
where Z,, is given by (2). Hence, for the example of (3), 
bn =O Yeh <I =1, [=1,...,k. (67) 
= 


The inequalities in (67) determine the admissible vectors (x}_,,... ,x}_1) 
corresponding to b,, = 0, and thus, from (65), which constants c; occur 
in (46). Note, in particular, that x}_, = 0 implies that 6(0,u1, ... ,wz) 
is independent of uz. 

For purposes of illustration, we show how to calculate the values of 
c; for the example of (3), subject to (4), in the case k = 3. From the above 
procedure it is found that 
(0,U1,U9,U3) = x(Uy,U2) = w(Cooo + C1111 + Core 

+ Coo9Ut + Ci29U1U2). (68) 
But from (2) and (42), 
@, (Ug,W1,U2,U3) = O[(1 — p)uqug + pug] ¥ (uo). (69) 
Hence, from (45), 
) (1 — s)x(s,s)O[(1 — p)s? + ps] ¥(s) 


&(s,5,5,8) = (O[(1 — p)s? + ps] ¥(s) —s} (70) 
Then, from (44), we obtain 
ecuae (1 —s)x(s,s)O[(1 — p)suy + ps|¥(s) (71) 


{O[(1 — p)s2 + ps] V(s) — s} 
Also, from (48), we have 
®(s,8,U1,U2) = [s—!@(s,s,s,u1) + (1 — s~})x(s,s)] 
X O[(1 — p)su2 + ps] W(s), (72) 
and 
P(s,U1,U2,U3) = [s~1B(s,s,u1,U2) + (1 — s~)x(s,u1)] 
X O[(1 — p)suz + ps]W(s). (73) 
From (71) to (73), it follows that 
(1 =s) 
s 





#(s,U1,U2,U3) = O[(1 — p)sug + ps] ¥(s) 


W(s) 





x jela — p)suz + ps] 


| O[(1 — p)sui + ps) V(s) 


{O|(1 — p)s2 + ps] ¥(s) — s} - | x(s,s) — x(su)}. (74) 
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From finiteness at s = 0 we deduce, with the help of (6), that 


x(0,u3) = w[1 + (1 — p)pigotrlcooo. (75) 
Hence, from (68), 


Cour = (1 — p)P1G0Coo0- (76) 
If we now let s — 0 in (74), and equate coefficients, we obtain, in addition 
to (76), the relations 

Ci2e = (1 — p)*pigécooo, Co22 = (1 — p)2popeqscooo, (77) 

and 
Ci = (1 — p)go(pi + PoP1q1 + 2pPoP2G0)Co00 

+ (1 — p)popigs(ci11 + Co11) — Pogoci2z2. (78) 
If we substitute the values of co11 and ¢499, from (76) and (77), into (78), 
we obtain (9). 
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Speaker Verification by Human Listeners over 
Several Speech Transmission Systems 


By C. A. MCGONEGAL, L. R. RABINER, and B. J. MCDERMOTT 
(Manuscript received January 6, 1978) 


Although a great deal has been learned about how speakers are 
verified, both by humans and by machines, several factors have not yet 
been studied. One of these factors is the effect of the transmission 
system (over which the message is communicated) on the accuracy with 
which verification is achieved. This factor is potentially an important 
one for digital communications problems over telephone lines where 
the transmission system could vary from one which gives a high-quality 
coded representation of the signal (e.g., log PCM) to a low-bit-rate vo- 
coder. The purpose of this paper is to demonstrate the effects of three 
speech transmission systems on verification accuracy by human lis- 
teners. It is shown that the false alarm rate (i.e., a customer ts rejected) 
is significantly higher when the test and reference utterances are 
transmitted by different systems than when transmitted by the same 
system. The miss rate (i.e., an imposter is accepted) 1s not significantly 
different for similar comparisons except for one of the conditions. The 
overall conclusion of this experiment is that speaker verification by 
human listeners cannot be performed as accurately over mixed speech 
transmission systems as over the same transmission system. 


l. INTRODUCTION 


Speaker verification, both automatically by machine and by human 
listeners, is an important problem in the area of man-machine commu- 
nication by voice.!-® The verification problem has applications in the 
business community for such things as voice banking by telephone, credit 
card transactions (including charging of telephone calls), and access of 
privileged or confidential information. 

As shown in Fig. 1, the speaker verification problem, either by human 
listeners or by machine, has two aspects—the creation of a reference 
pattern (i.e., the training phase) and the determination of similarity 
between a test and a reference pattern (i.e., the testing phase). When 
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Fig. 1—Block diagram of the speaker verification problem. 


verification is to be performed over telephone lines, an additional factor 
complicates the problem—namely, the transmission system used in the 
telephone plant. With the increased reliance on digital speech-processing 
techniques, such as waveform coders and linear prediction methods, the 
interesting possibility arises that the test pattern for speaker verification 
may have been coded or vocoded, whereas the reference pattern may not 
have been subjected to the same processing. Past experiments by 
Rosenberg? have studied the problem of verification both by human 
listeners and with an automatic system using natural speech for both 
the test and reference patterns. The purpose of this experiment is to 
evaluate how several speech transmission systems affect the process of 
speaker verification by human listeners. In future work, we will inves- 
tigate the parallel problem—how these same factors affect automatic 
methods of speaker verification. 

The organization of this paper is as follows. Section II describes the 
way in which the evaluation was carried out. This section includes a 
description of the speech transmission systems, as well as the experi- 
mental procedure used to measure system performance. In Section III, 
the experimental results are presented in terms of a signal detectability 
model, and in Section IV the results are discussed. 
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li. EXPERIMENTAL EVALUATION 


For the experiment to be described below, both the reference and test 
utterances were preprocessed by one of the following three transmission 
systems: 

(t) Bandpass filtering from 100 to 2600 Hz. 
(it) Adaptive differential pulse code modulation (ADPCM) coding, 
followed by bandpass filtering from 100 to 2600 Hz. 

(iit) Linear predictive vocoding (LPC), followed by bandpass filtering 

from 100 to 2600 Hz. 
The bandwidth of all three systems was set to 2500 Hz, in accordance 
with the requirements of the ADPCM coder, to ensure that the speech 
bandwidth was not a factor in determining relative verification ac- 
curacy. 

The ADPCM coder used in this experiment was a simulation of 
the coder built by Bates,? based on the work of Cummiskey et al.!° 
Figure 2 is a block diagram of the ADPCM system. The input signal is 
band-pass-filtered from 100 to 2600 Hz and sampled at a 6000-Hz rate. 
A 4-bit adaptive quantizer was used to code the difference signal, giving 
an overall bit rate of 24 kb/s for the coder. The step-size multiplier of 
the quantizer ranged over a 41-dB range (i.e., the ratio between the 
largest and smallest step size was 114 to 1). A first-order predictor was 
used with a multiplier of a = 0.9375. Signal levels were chosen so that 
the coder was operating at approximately the optimum point. 
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Fig. 2—Block diagram of an ADPCM coder. 
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A block diagram of the LPC vocoder is given in Fig. 3. The imple- 
mentation was based on the autocorrelation method of linear predic- 
tion.!!-18 Pitch detection and voiced-unvoiced decision were performed 
using the modified autocorrelation pitch detector of Dubnowski et al.!4 
The input signal was sampled at a 10-kHz rate, and a 12-pole LPC 
analysis was done with a pitch adaptive, variable frame size, at a rate of 
100 frames per second.!> No quantization of the LPC parameters was used 
in this experiment. 


2.1 Data base for the evaluation 


To evaluate the three transmission systems, a data base was designed 
which included 
(t) 16 speakers designated “customers”: 
8 male. 
8 female. 
(ii) 62 speakers designated “imposters”: 
31 male. 
31 female. 
(iii) 2 sentences: 
““‘We were away a year ago” —male utterance. 
“T know when my lawyer is due.” —female utterance. 
(iv) 3 versions of each utterance: 
bandpass filtered speech—SP. 
ADPCM coded and filtered speech—ADPCM. 
LPC vocoded and filtered speech—LPC. 
The set of male utterances used in this study were those used by 
Rosenberg in his earlier work.4 New recordings were made for the set of 
female utterances. Both male and female speakers recorded 10 utter- 
ances over a period of several weeks. The imposters provided just one 
recording each. 


2.2 Experimental procedures 


To test the effects on verification of combinations of different speech 
systems for the reference and test utterances, a paired-comparison test 
was used. A block diagram of the experimental arrangement used is 
shown in Fig. 4. Each test pair consisted of a comparison utterance and 
a challenge utterance. The comparison utterance was always a customer 
utterance processed by one of the three transmission systems. The 
challenge utterance was either an imposter utterance (customer-imposter 
pair) or one of the remaining nine utterances of the same customer 
(customer-customer pair) processed by one of the three systems. 

Ten analog tapes were prepared. Each tape consisted of only male or 
female utterances with 48 customer-customer and 48 customer-imposter 
pairs randomly presented. The eight customers were presented in each 
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Fig. 3—Block diagram of an LPC vocoder. 
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Fig. 4—Block diagram of the experimental arrangement. 


of the transmission system combinations in both types of paired com- 
parisons. When the transmission system combinations were heteroge- 
neous, four of the eight customers were presented in each of the two or- 
ders. 

Two tapes, one containing the male utterances and one containing the 
female utterances, were presented over headphones to five different 
groups of six naive subjects who were seated in a soundproof booth. The 
subjects were asked to indicate whether the comparison and challenge 
utterances were spoken by the same or by different speakers. They re- 
ceived no training for the experiment and were given no instructions as 
to the costs of any type of error. 


il. EXPERIMENTAL RESULTS 

The subject responses can be interpreted according to signal detection 
theory!6—i.e., as a hypothesis test. For any trial in the test, there were 
two input hypotheses (same speakers or different speakers) and two 
possible subject responses (SAME and DIFFERENT). Therefore, each trial 
can be represented by the intersection of one of the input alternatives 
and one of the response alternatives as indicated in Fig. 5. There are two 
types of errors (false alarm and miss) and two types of correct responses 
(hit and correct rejection) associated with each trial. A false alarm (the 
rejection of a customer) is defined as a subject response of DIFFERENT 


2892 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1978 


RESPONSE 


“SAME” “DIFFERENT” 


FALSE 


a CORRECT ALARM 
nae REJECTION (CUSTOMER 


REJECTED) 


INPUT 


MI 
“DIFFERENT” deoeten 
ACCEPTED) 





Fig. 5—The various response classifications for detecting an imposter. 


when both utterances are spoken by the same speaker. A miss (the ac- 
ceptance of an imposter) is defined as a subject response of SAME when 
the challenge utterance was spoken by a different speaker. A hit (ac- 
ceptance of a customer) is defined as a subject response of DIFFERENT 
when the challenge utterance was spoken by a different speaker. A cor- 
rect rejection (rejection of an imposter) is defined as SAME when both 
utterances are spoken by the same speaker. 

The false alarm rates for male and female customers are shown in Fig. 
6. The customer false alarm rates are represented by vertical bars—one 
bar per customer for each pair of transmission systems. The percentage 
of time a customer was rejected varied for each customer in a group and 
also between groups. Although customers were asked to record their 
sentence the same way at each session, several had dramatic pitch 
changes. Since the subjects were not familiar with the customer voices, 
they tended to reject those customers. In general, the false alarm rates 
were fairly low and in many cases less than 10 percent. 

The miss rates for the male and female customers are shown in Fig. 
7. The percentage of time an imposter was accepted also varied greatly 
among customers. As seen in this figure, the miss rates were generally 
higher than 15 percent for all transmission pairs except for the LPC- 
ADPCM pair. 

An alternative way of displaying the information in the subject data 
is in terms of the likelihood ratio. The likelihood ratio, J, is a good mea- 
sure of signal detectability and is defined as 


_ P (hit) 
P (false alarm) ’ 


where 


P (hit) = 1 — P (miss). 
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Fig. 6—False alarm rates for male and female customers for all transmission system 
pairs. 


Figure 8 is a plot of the likelihood ratios for the male and female cus- 
tomers for transmission system pairs. The height of a vertical bar is the 
ratio of the number of times a subject correctly detected an imposter to 
the number of times a subject rejected a customer. Therefore, the larger 
the ratio (the higher the vertical bar), the better the subject performance 
for that transmission pair. As seen in this figure, the ratios are highest 
among the homogeneous systems for both male and female customers 
and lowest among the mixed systems. For several of the female cus- 
tomers, there were no false alarms so the likelihood ratios are infinite. 
Again, the large amount of variation among customers can be seen. 
Because of the high variation among the customers (as seen in the 
preceding figures), the data for the false alarm and miss rates were pooled 
on the basis of median error scores rather than mean error scores. The 
median errors of the eight customers for each pair of transmission sys- 
tems are shown in Fig. 9. Both a chi-square and a Fisher test!” were ap- 
plied to the median data to determine when significant differences ex- 
isted between (i) the male and female customer medians for each pair 
of transmission systems (no significant differences were indicated) and 
(ti) the combined male and female customer medians of each pair of 
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Fig. 7—Miss rates for male and female customers for all transmission pairs. 


transmission systems. The following significant differences were 
found. 

(t) The false alarm rate for mixed systems, that is, when the test and 
reference utterances were processed by different transmission systems, 
is significantly different from that of homogeneous systems. 

(it) The false alarm rate for the SP-SP pair was significantly different 
from all other transmission pairs. 

(tit) The miss rates for mixed and homogeneous systems were not 
significantly different except for two transmission pairs. The LPC-ADPCM 
system had a significantly lower miss rate than the other system pairs, 
and the LPC-LPC pair had a significantly higher miss rate than any other 
transmission system pair. 

Finally, Fig. 10 shows the overall error rates, that is, the average of the 
false alarm rates and miss rates, for each speech transmission pair. The 
overall error rate is between 10 and 20 percent for all speech transmission 
pairs. The lowest overall error rate is observed for SP-SP and ADPCM- 
ADPCM transmission pairs. There is no significant difference in the 
overall error rate for any of the system combinations. 
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IV. DISCUSSION OF THE RESULTS 


The results (i.e., the false alarm and miss rates for the speech trans- 
mission pairs) can be interpreted in terms of the dimensions of difference 
between transmission systems. There are three main differences between 
system pairs, as shown in Fig. 11. These are: 

(i) No difference—the same transmission system is used for both 
the test and reference utterances. The transmission system pairs in this 
category are SP-SP, ADPCM-ADPCM, and LPC-LPC. 

(11) One dimension of difference—the transmission system pair 
consists of one utterance transmitted over a clear channel and the other 
utterance processed over an ADPCM or LPC system. The transmission 
system pairs in this category are ADPCM-SP and LPC-SP. 

(uit) Two dimensions of difference—two very different transmission 
systems are used for the test and reference utterances. Only the LPC- 
ADPCM pair is in this category. 

In category (1), the median customer rejection (false alarm) rates were 
very low, and the median imposter acceptance (miss) rates were very 
high. This result reflects a subject bias toward responding SAME when 
the test and reference utterances are processed over the same trans- 
mission system. The low customer rejection rates in this category also 
indicate that subjects can easily verify customer-customer pairs. 
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Fig. 10—Overall error rates for all customers and for all transmission pairs. 


In category (ii), the median error rates were approximately the same 
for both customer-customer and customer-imposter pairs. For these 
cases, about 15 to 20 percent of the time a customer would be rejected 
and an imposter would be accepted. Whether a customer or an imposter 
was processed over either one of the transmission systems seemed to 
make very little difference. This result indicates that the subjects were 
confused by the pairing of an ADPCM or LPC system with a natural speech 
utterance. . 
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Fig. 11—Dimensions of difference between transmission system pairs. (a) No differ- 
ence—SP-SP, ADPCM-ADPCM, LPC-LPC. (b) One dimension of difference—ADPCM-SP, 
LPC-SP. (c) Two dimensions of difference—LPC-ADPCM. 


In category (iii) (i.e., LPC-ADPCM pair), the median customer rejection 
rate was very high and the median imposter acceptance rate was very 
low. The speech quality produced by the ADPCM and LPC systems is 
extremely different, and the results seem to reflect a subject bias toward 
responding DIFFERENT in this situation. 

The overall conclusion is that the speaker verification task by human 
listeners is easiest when homogeneous systems are used and is signifi- 
cantly more difficult when mixed systems are used for the test and ref- 
erence patterns. 


V. COMPARISONS WITH PREVIOUS WORK 


Since the male utterances used in Rosenberg’s experiment? were also 
used in this experiment, the male SP-SP error rates can be compared. The 
median false alarm and miss rates for the two experiments are shown 
in Table I. The median false alarm rate observed by Rosenberg was 3.3 
percent, which is about two times smaller than the error rate seen in this 
study. The median miss rate observed by Rosenberg was only 2.8 percent, 
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Table | — Comparison of male SP-SP median error rates 


Experiments False Alarm Rate Miss Rate 
Current experiment 7% 15% 
Rosenberg 3.3% 2.8%* 


* The average miss rate after completion of 25 percent of the listening sessions was six 
percent. 


which is considerably less than that observed here. Ideally, the error rates 
for the two experiments should be the same. However, several differences 
between the two experiments may have influenced the results. These 
are: 
(1) The bandwidth used in Rosenberg’s experiment was 4 kHz, 
whereas the bandwidth used here was 2.5 kHz. The 2.5 kHz bandwidth 
was required by the ADPCM system, which had a sampling frequency of 
6 kHz. 

(ti) The written instructions given to the subjects differed in both 
experiments. In Rosenberg’s experiment, the subjects were divided into 
two groups. One group was provided with instructions intended to lower 
the false alarm rate, while the other group was provided with instructions 
intended to lower the miss rate. In this experiment, all subjects received 
the same instructions with no intent to lower either type of error rate. 

(iit) In this experiment, no repeat judgments were obtained from any 
one subject, but in Rosenberg’s experiment, 32 repeated judgments were 
obtained from each subject for customer-customer pairs and 4 judgments 
were obtained for each customer-imposter pair. Even though there was 
no prior training in either experiment, Rosenberg noted a training effect 
imbedded in his data. He found an average miss rate of 6 percent after 
the completion of 25 percent of the listening sessions. This rate is ap- 
proximately two times less than the miss rate we observed. No drop was 
noticed with regard to the false alarm rate. 

(tv) The last factor that may have influenced the difference in the 
results is the fact that Rosenberg’s experiment consisted entirely of SP-SP 
test presentations. In this experiment, the SP-SP presentations were 
randomly combined with all other transmission pair presentations. 

Vi. SUMMARY ; 


The purpose of this experiment was to show the effect of different 
transmission systems on speaker verification accuracy by human lis- 
teners. It was shown that when the reference and test utterances were 
recorded from different transmission systems, the false alarm rate was 
significantly larger than when they were recorded from the same 
transmission system. However, with one exception, the miss rates were 
essentially equivalent, independent of the transmission system. As such, 
it is concluded that speaker verification by humans cannot be performed 
as accurately when different transmission systems are used. 
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Efficient Utilization of Satellite Transponders via 
Time-Division Multibeam Scanning 
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The space segment of a satellite system is proposed wherein a fixed 
number of identical transponders are shared among a larger number 
of spot beam regions which completely span a large total service area. 
Time-division multiple-access techniques are employed, and each 
transponder is rapidly scanned over appropriately defined group pairs 
of spot beam regions, thereby establishing full coverage and full in- 
terconnectivity. The service is matched to the nonuniform traffic re- 
quirements exhibited among the various spot beam regions, reliability 
can be optimized since all transponders are identical, and each tran- 
sponder ts utilized with an efficiency of 100 percent. A mathematical 
proof is presented which shows that the traffic can always be assigned 
on a nonconflicting basis, and an efficient assignment technique is 
described. 


1. INTRODUCTION 


The trend toward higher frequency communication satellites 
employing multiple spot beams affords significant capacity advantages 
relative to lower frequency, wide-coverage area systems, since the allo- 
cated spectral band can be reused in the various spot beams.!:2 When 
used in conjunction with digital modulation techniques and time-division 
multiple-access, the various coverage regions are readily interconnected 
via an onboard satellite switch operating in the time-division multiplex 
mode. In addition to the frequency reuse capability, the down-link 
transmitter power requirements are generally reduced because the an- 
tenna gain is higher than for a wide coverage area system. 

Despite these advantages, however, multiple spot beam satellites have 
several distinct drawbacks. These are generally associated with con- 
flicting requirements concerning reliability, coverage and blackout 
areas,‘ efficient transponder utilization, and nonuniform traffic density 
demands. 
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In the following sections we explore these conflicting requirements 
and review some partial solutions proposed to date. Then we present a 
space segment configuration for a satellite communication system which 
provides high reliability with a minimum of redundancy, access from 
any location within a wide service area, and up to 100-percent efficiency 
in transponder capacity utilization matched to an arbitrary nonuniform 
density of traffic demand over the entire service region. This system 
employs WN identical transponders which are shared on a time-division 
basis among M = N antenna ports spanning the entire service region. 
Starting with the traffic demand matrix, we give a mathematical proof 
that the desired arrangement is always possible, and present an assign- 
ment algorithm. 

Such a system might find applicability to a geosynchronous satellite 
operating in the 12/14-GHz band. From synchronous orbit, the 3-dB 
contour of a beam radiated from a 214-m antenna would cover about 1 
percent of the continental United States. Total United States coverage, 
then, would require about 100 such beams. Not only is the offered traffic 
nonuniformly distributed over the subregions, but within most such 
subregions the traffic is far too small to justify deployment of a dedicated 
wideband transponder to each. Moreover, from a practical viewpoint, 
the number of onboard satellite transponders is limited to the range of 
10 to 20 by weight, power, and cost constraints. Through proper time- 
division assignment, these 10 to 20 transponders can efficiently provide 
service to the entire United States. 


ll. PROBLEMS OF MULTIPLE SPOT BEAM SATELLITES 


A major problem in multibeam satellite design is one of transponder 
reliability. Unlike area coverage systems wherein the allocated band is 
divided among several transponders and service is provided via fre- 
quency division multiple access, it is desirable to serve each spot beam 
of a multibeam satellite system with a single transponder. With this 
approach, the required number of transponders is kept from becoming 
prohibitive, and the weight of the communications subsystem is mini- 
mized. However, sufficient redundancy must be provided to ensure high 
reliability for each transponder since single failures would preclude 
continuing service to the area serviced by that transponder. By contrast, 
for area coverage systems using frequency division multiple access, 
isolated failures merely cause a slight increase in the demand presented 
to the surviving transponders. 

A second problem in multibeam satellite systems concerns efficient 
utilization of the satellite transponders. In general, the traffic demands 
from the various coverage areas (or footprints) are nonuniform. Thus, 
to utilize each transponder fully, the capacity of each must be tailored 
to the traffic demand of the area covered by that transponder. A tech- 
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nique for achieving such a custom fit has been reported,® wherein the 
bit rate of each beam is selected as a fixed multiple of some basic rate. 
At the satellite, each uplink beam is demultiplexed into several basic rate 
bit streams, switched, and then remultiplexed into downlink beams. One 
disadvantage of this scheme is that onboard demodulation and remo- 
dulation is required. However, a more serious disadvantage in such a 
system is the need for nonidentical transponders, which precludes 
sharing a common pool of spare transponders among all beams, and the 
reliability of the system suffers.? 

A third problem of multibeam satellites involves means of accessing 
traffic from areas not within the footprint of some spot beam. Several 
solutions have been proposed,* involving sharing the spectrum between 
spot beams and an area coverage beam. These have the disadvantage 
that the area coverage transponders are different from the spot beam 
transponders and have higher power requirements to compensate for 
the loss of antenna gain. Also, the fixed spot beam transponders (as- 
sumed identical) are not matched to traffic requirements of the area 
served. 

Another solution to the access problem involves the use of a steerable 
spot beam which can be rapidly scanned across the entire service region 
via a phased array antenna, thereby providing universal coverage.® When 
used in conjunction with a multitude of fixed spot beams, the resulting 
hybrid system has the advantages of frequency reuse, high antenna gain, 
and identical transponders, and hence is the most attractive proposal 
among those reported to date. A similar system which provides for beam 
scanning by appropriate excitation of feedhorn clusters has also been 
proposed.’ However, such systems do not utilize the transponders effi- 
ciently, because of nonuniform traffic demands from the various areas 

covered. 


Ill. TIME DIVISION MULTIBEAM SCANNING SATELLITE 


To enable frequency reuse via a multibeam satellite system employing 
identical transponders, so that all transponders are used at maximum 
efficiency and a uniform grade of service is provided over the service area, 
we propose to generalize upon the scanning-beam approach. 

Consider a satellite employing N identical wideband transponders, 
each with a capacity or throughput of C units. The diameter of the sat- 
ellite antenna and the resulting beamwidth determine the number M 
of distinct footprints needed to provide service anywhere throughout 
the required service area. In general, M may be much greater than N, 
but in what follows we only require that M = N. 

The system traffic can be represented by an M X M matrix [t;;] as 
shown: 
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[tj] = (1) 
tm1 tme <-*: tum 


The element t;; represents the traffic originating in beam i and destined 
for somewhere in beam j. Each footprint might contain several ground 
stations, so ¢;; represents the sum of the traffic from all stations within 
beam i which is directed to stations within beam Jj. 

It is not necessary that the traffic matrix be symmetric, and a loop- 
back feature is possible, i.e., we do not require t;; = t;;, nor do we require 
ti; = 0. Of course, tj; 2 0. 

Two requirements must be imposed on the traffic matrix [¢;;]. First, 
since the total capacity of the satellite is equal to NC (N transponders 
each of capacity C), we require that 


M M 
T=> tga NC, (2) 
i=1j=1 
The second requirement is that the traffic originating from or destined 
for a particular beam should not exceed the capacity of one transponder, 
1.e., 


M 
Rowsum Rj = Sty SC 1 =1,2,...M (3) 
j=l 
M 
Column sum S; = >) ¢;,<C j=1,2,...M. (4) 
i=l 


The transponders are utilized with 100-percent efficiency when (2) 
is satisfied as an equality. This equation may be interpreted as estab- 
lishing the minimum number N of transponders required. Conditions 
(3) and (4) are necessary because no two transponders can be connected 
to a common spot beam (either uplink or downlink) on a noninterfering 
basis. 

If the total offered traffic equals the sum of the transponder capacities, 
we have the potential for 100-percent utilization. We will show that it 
is possible to interconnect the various uplink beams, transponders, and 
downlink beams such that this is achieved. We do this on a time-division 
basis by enabling each of the N transponders to access any of the M re- 
ceive (uplink) antenna ports and any of the M transmit (downlink) an- 
tenna ports. Figure 1 shows the use of two M X N crossbar type switches 
which enable any required interconnection. Alternatively, the appro- 
priate interconnections could be achieved by using N phased array an- 
tennas as shown in Fig. 2. 

It remains to be shown that all the offered traffic can be allocated 
among the N transponders on a noninterfering basis, i.e., at any instant 
of time, the N transponder inputs are each connected to a different re- 
ceive port, and the N transponder outputs are each connected to a dif- 
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Fig. 1—Satellite communication subsystem for rapid TDMA scanning of multiple 
transponders using two MX N crossbar switches. 


ferent transmit port. The theorem below guarantees that such an as- 
signment is always possible. 

Definition: A diagonal of a matrix [t;;] isa K-tuple D = {d1,dg, . . . dx}, 
where each member is a nonzero element of the matrix and no two ele- 
ments appear in the same row or same column of the matrix. The length 
of the diagonal is K (the number of elements) and the diagonal is said 
to cover the K rows and K columns from which the elements are 
taken. 

Theorem 1: Ina traffic matrix [t;;] for which T = 2M, 3! t;; equals NC 
and for which no row or column sum exceeds C, a diagonal of length N 
exists which covers all rows and columns which sum to C exactly (if 
any). 

The proof of this theorem is somewhat lengthy and is presented in the 
appendix. 

For convenience, we will assume that the elements tj; of the traffic 
matrix are integers, representing the traffic as multiples of some basic 
unit such as, for example, one voice channel. 

We shall assign traffic to the various transponders as follows: Let the 
TDMA frame consist of C time slots, each representing one unit of traffic. 
There are N such frames, one belonging to each transponder. In the 
traffic matrix [¢;;], find any diagonal of length N which covers all rows 
and columns summing to C (if any). Theorem 1 guarantees this is always 
possible. From these N diagonal elements, extract one unit of traffic from 
each and assign one unit to each of the N transponders. Since the traffic 
assigned to the transponders (for this time slot) originates from different 


SATELLITE TRANSPONDERS 2905 


PHASE PHASE 
SHIFTERS SHIFTERS 


7 Pa 
O) O) 

(*) Ot. 

/ N “ 

/ & TRANSPONDERS (+) N:1 COMBINER 
1:N SPLITTER | | | 
| 

| | | | 
| | | | 
OATH Pte 


ure 





E RECEIVE ANTENNA ELEMENTS 
E TRANSMIT ANTENNA ELEMENTS 


ie 1__ f 
E:1 A 
NER : 
Cone SPLITTER 






TDMA SLOT 
ASSIGNMENT 


Fig. 2—Satellite communication subsystem for rapid TDMA scanning of multiple 
transponders using receive and transmit phased-array antennas. Each transponder can 
be steered independently to M transmit and M receive spot beam regions. 


uplink beams and are directed to different downlink beams, then the 
traffic has been assigned on a noninterfering basis. 

Now since N units of traffic have been removed from the matrix, the 
reduced matrix has a total traffic of NC — N = N(C — 1) units. Fur- 
thermore, each transponder has C — 1 units of traffic carrying capacity 
left, and no row or column of the reduced matrix sums to more than C 
— 1. The latter is true because every row and column which summed to 
C in the original matrix has had one unit of traffic removed (because of 
the way the diagonal was constructed). 

At this stage, we have the same situation as we started with, except 
C — 1replaces C. By the same technique, we can assign another N units 
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of traffic to the next time slot in each transponder, and end up witha 
matrix with remaining traffic N(C — 2) in which no row or column sums 
to more than C — 2. Each transponder has then C — 2 time slots unal- 
located. Hence, we can repeat this procedure until all transponder time 
slots are used and no traffic remains unallocated. 

Thus, the nonuniform demands of a traffic matrix can be met by N 
identical transponders each operating at 100-percent utilization effi- 
ciency. We also note that, although the method described was for a 
matrix for which eq. (2) was satisfied as an equality (i.e., T = NC), it also 
applies to a matrix for which T < NC, because we can always pad such 
a matrix with dummy traffic® until T = NC. The assignments corre- 
sponding to the dummy traffic can be ignored, and simply reflect the fact 
that the available transponder capacity exceeds the demand. 

The assignments are not unique, and it may be possible to extract more 
than one unit of capacity per diagonal element at a time. This is desirable 
from a practical point of view as it minimizes the number of times the 
M X N switches have to be reconfigured during one frame period. To 
achieve this, it seems desirable to choose the N diagonal elements from 
large elements in the rows and columns with the largest sums, if possible. 
The maximum traffic extractable is ¢ = min (t1,t2), where t; = smallest 
element on the diagonal and C — fg is the largest row or column sum 
among the rows and columns not covered by the diagonal. 

As an example, consider the matrix below with N = 3 and C = 138. 


Downlink beam j 


3 
2 
0 
6 


1 
3 
6 
0 
2 
1 





= T 


In Fig. 3 we show the successive reductions of the matrix as the traffic 
is assigned to transponders. The diagonal elements chosen are circled, 
and rows and columns which sum to (the reduced value of) C are marked 
with an asterisk. 

The corresponding traffic assignments to the transponders are shown 
in Fig. 4. The switch must be reconfigured six times per frame for this 
solution. 


IV. CONCLUSIONS 


Although the system described has been presented in terms of sub- 
dividing the transponder capacity by time division, it is applicable to 
any other method of subdividing the transponder, e.g., by frequency 
division or by a combination of time and frequency division. In a fre- 
quency-division system, the smallest subdivision unit of capacity would 
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Fig. 3—Illustrative reduction of a 4 X 4 traffic matrix. The matrix contains 39 units of 
traffic, and there are three transponders, each of capacity 13. 


usually be larger than for a time-division system, and transponder lin- 
earity would be an important consideration as far as crosstalk is con- 
cerned. In a time-division system, transponder nonlinearities are more 
tolerable. 

For the system proposed, reliability of the transponders could be 
provided by the usual method of having a standby transponder for every 
transponder in use or perhaps sharing a standby with two operational 
transponders. However, an interesting alternative is to provide N’ > N 
transponders and use M X N’ switches at input and output. In this way, 
failed transponders can be excluded by simply modifying the switching 
sequence, and we have a pool of spare transponders which can be used 
to supply replacements for any that fail. 

The output switch at the satellite will generally operate at a high power 
level, and switching time may be a significant factor. In the phased-array 
realization, the equivalent problem is the time taken to steer the beam 
from one area to another. In either case, reducing the number of switch 
reconfigurations per frame will minimize any overhead time due to 
switching delays. Since the switching sequence, once decided upon, is 
not changed from frame to frame, a search for an efficient switching 
sequence is worthwhile. 

In practice, other considerations besides switching delays would also 
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Fig. 4—TDMA frame assignment for the example of Fig. 3. The numbers 1 through 4 
appearing within each frame correspond to the spot beam coverage areas 1 through 4 of 
the 4 X 4 traffic matrix. 


be of importance. One additional consideration would be the interference 
between stations in adjacent beam-coverage areas. This interference 
would be reduced by ensuring that the stations did not transmit or re- 
ceive during the same time slot, or by using different polarizations. Al- 
ternatively, adaptive interference cancellation can be performed for the 
phased-array implementation. Constraints imposed by considerations 
such as these, however, may result in more than the minimum number 
of transponders being required to transmit the traffic. 

We have described a system which enables efficient utilization of 
transponder capacity, while at the same time providing service over a 
wide area with a uniform grade of service, identical transponders in the 
satellite which can be coupled with a very efficient standby method for 
maintaining transponder reliability, and a system which can be adapted 
to changing traffic demands by simply altering the switching sequence 
at the satellite. 


APPENDIX 
Proof of Theorem 17 


The definition of a diagonal and its length are found in the main 
text. 

The proof is approached by first establishing a number of lemmas 
which can then be used in the proof of the theorem. 
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The theorem as proved here is slightly more general than that pre- 
sented in the main text, since the existence of a diagonal of the required 
type is demonstrated for (N — 1)C < T < NC rather than just for T = 
NC. . 

Lemma 1: If ina matrix [t;;] there exist diagonals D; = {a1,a2,... an} 
of length N and Dz = {b,bo,... br} of length L = N, then it is possible 
to construct: 
(1) A diagonal Ds of length L which covers all the rows and columns 
covered by Dy. 
(11) A diagonal D4, of length = L which covers the rows D, and the 
columns of Do. | 

(ut) A diagonal Ds of length = L which covers the columns of D, and 

the rows of Do. 

Proof: 
(i) Form disjoint sets S,,,m = 1,2,...from the elements of D; and 

Dz as follows: 

(a) Once an element has been allocated to a set, it is not con- 
sidered further. 

(b) To form set S,,, choose an initial element from among those 
not yet allocated to a set. 

(c) If b; is on the same row or column as an a; € Sm, then ; € 


me: 
(d) If a, is on the same row or column as a 0; € S,, then a, «€ 


a 
(e) Continue adding elements to S,, using (c) and (d) until no 
more can be added. If unallocated elements still remain, 
form a set S,.41 starting at step (b). 
(11) The sets have the property that an element from one set cannot 
share a row or column with an element from another set. 
(tit) The sets are of the following varieties: 
V1: Sets with equal number of elements from D, and D» which 
cover the same rows and columns e.g., a, b coincident, 





V2: Sets with equal numbers of elements from D, and Dz which 
cover the same rows, but not the same columns, 


eg., a — b a - 
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V3: 


V4: 


V5: 


Sets with equal numbers of elements from D; and D2 which 
cover the same columns, but not the same rows, 


e.g., a a b 


Sets with one more element from D, than from Dg, in which 
the elements from D, cover all the rows and columns of the 
elements from Do, 


{ 


e.g., a a — b a 


a 
| 
a b 


U 
Q—_—— 


—- @ 


b-—a 


eo" 


Sets with one more element from D2 than from Dj, in which 
the elements from Dz cover all the rows and columns of the 
elements from D, 


e.g., 6b 6b - 


a-—b 


Not all varieties need be present, but it is easily seen that there must 
be at least L — N sets in V5. 


Let V5’ = {any L — N sets from V5}. 


Dg = {a; £V5,b; € V5} 
D, = {a; € V,0; € Vo,Q; € V3,Q; € V4,0; € Vs} 
Ds = {a; € Vi,a; € Vo,b; € V 3,0; € V4,0; € V3} 


are diagonals of the type required. 
Lemma 2: If the maximum length of a diagonal in a matrix [tj| is N, 
and the row and column sums do not exceed C, then 


Ty =) di ty = NC. 
cr 3 
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Proof: If all the nonzero elements are in at most N rows (or columns), 
then by summing over these rows (or columns) we obtain Ty < NC. 
We therefore need only consider the case where there are more than 
N rows and more than N columns which contain nonzero elements. 
Let D, = {a1,a9,... ay} be a diagonal of length N covering columns 
JiJ2,..-Jn- There must be another column jy+; with a nonzero element 
x. One of the elements of D,, say a,, must be on the same row as x, oth- 
erwise {a1,@9,... @y,x} would form a diagonal of length N + 1. 
Remove the row containing x and a, from the matrix. We will show 
that the reduced matrix has a maximum diagonal length of N — 1. 

In the reduced matrix we have Dj = {ay,... @-—1,0-41,... ay} of length 
N — 1 and suppose there is also a diagonal Do = {b1,b9, .. . bx} of length 
N (in the reduced matrix). By Lemma 1 we can find from D; and D2a 
diagonal Ds of length N which covers all the rows and columns covered 
by D;. Since N — 1 of the elements of D3 are in columns 
Ji.++Jr-bJr+1-+-JNn, then both columns j,,j41 cannot be covered by 
the Nth element. Hence, D3 augmented by either a, or x would form a 
diagonal of length N + 1 in the original matrix. Hence, no such diagonal 
Do exists. 

The reduced matrix satisfies the same conditions as the original matrix 
except N — 1 replaces N. 


Ty=Ty-1+RSTy-1+C, 
where 
Ty = sum of elements in original matrix 
Tn-, = sum of elements in reduced matrix 
R = sum of elements in row removed < C. 


Hence, Ty < NC if Ty_; < (N — 1)C. Since To = 0, an inductive 
argument establishes the result. 
Lemma 3: In a matrix [t;;] for which the row and column sums do not 
exceed C, and for which T = 2;2;t;; satisfies (N —1)C < T < NC for 
some integer N, there exists a diagonal of length L = N. 
Proof: Let L be the maximum diagonal length. 

By Lemma 2, T < LC 

But T >(N-—-1)C,soL >N —1, 

Hence, a diagonal of length L = N exists. 
Lemma 4: Ina matrix [ti] for which the row and column sums do not 
exceed C, and for which T = 2; 2;t;; satisfies (N — 1)C < T < NC for 
some integer N, there exists a diagonal of length N” = N which covers 
all rows and columns which sum to C exactly. 
Proof: The submatrix consisting only of the P rows which sum to C has, 
by Lemma 3, a diagonal D, = {a1,a9,... ,ap} of length P, because its el- 
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ements sum to PC. This diagonal covers all the P rows summing to C. 
Note that P < N. 

By Lemma 3, the original matrix has a diagonal Dz = {b1,b9, ... ,bn} 
of length N. By Lemma 1, we can construct from D,; and D» a diagonal 
Dy of length N’ = N which covers all the columns of Dz» and all the rows 
of D4. 

Let D; = {a}, ... ,@Q} be a diagonal of length Q of the submatrix con- 
sisting only of the Q columns which sum to C exactly. Note that Q < N 
=< N% 

Then by Lemma 1 we can construct from D} and D;a diagonal D5 of 
length N” = N’ which covers all the columns of Dj and all the rows of 
Dy (and hence all the rows of D). 

Hence, Ds covers all the rows and columns which sum to C exactly. 
Theorem: In a matrix [t;;| for which the row and column sums do not 
exceed C, and for which T = 3; 2;t;j satisfies (N — 1)C < T < NC for 
some integer N, there exists a diagonal of length N which covers all rows 
and columns which sum to C exactly. 

Proof: By Lemma 4, a diagonal D, = {a1,d9,... az} of length L = N exists 
which covers the P rows and Q columns which sum to C exactly. 

Divide D, into disjoint subdiagonals S,, So, and S3 with Lj, Lo, and 
Lz elements, respectively, with L; + Lo + L3 = L. 

S1 = {elements of D, in both a row and a column summing to C} 

So = {elements of D, in either a row or a column summing to C, but 
not both} 

S3 = {elements of D; in neither a row nor a column summing to C} 

If L, + Lo < N, then a diagonal consisting of the L; + Ly elements 
from S; and Sg plus any N — L, — Le elements of S3 is a diagonal of 
length N covering all rows and columns summing to C. 

Hence, we need only consider the case L; + Ly > N. Note that P+ Q 
= 2L,+L2>N also. 

Consider the submatrix consisting of P rows and Q columns containing 
only those elements (including zero elements) which lie in both a row 
and a column summing to C. We know §, is a diagonal of length L, of 
this submatrix. The sum of the elements in the submatrix = T’ = {sum 
of elements of original matrix in those P rows} — {sum of elements of 
original matrix in the same P rows, but which do not lie in the columns 
summing to C}. Hence, T’ = (P+ Q — N)C, since the first sum is PC and 
the second cannot exceed (N — Q)C. 

By Lemma 3, the submatrix has a diagonal S,4 of length P+ Q—-—N= 
2L,+Le—-—N>L;, (since L; + Lo > N). By Lemma 1 we can construct 
from S, and S, a diagonal S} of length 2L,; + Lz — N, which covers all 
the rows and columns covered by Sj. | 

Now S{ covers L; + Lz — N rows summing to C and L; + Lz — N col- 
umns summing to C not covered by S; and which must have been cov- 
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ered by So. Hence, form a new subdiagonal S, by deleting from S»2 the 
elements in these rows and columns. Then Sjand S3 cover different rows 
and columns. 

Now S; is a diagonal of length L; = 2L, + Ly — N and S; is a diagonal 
of length Ls = Ly — 2(L, + Lz — N). Thus, the elements of S; and S, form 
a diagonal which covers all the rows and columns which sum to C and 
its length is L} + Ly = N. 
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On Blocking Probabilities for 
a Class of Linear Graphs 
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(Manuscript received October 20, 1977) 


In a t-stage linear graph, all vertices are arranged in a sequence of 
stages such that any edge goes between a vertex in stage i and a vertex 
in stage i + I for some i. Lee first proposed the use of t-stage linear 
graphs for studying the blocking performance of t-stage switching 
networks. In his model, each edge is in either of two states, busy or idle, 
and the states of the edges are independent. Furthermore, an edge 
connecting a vertex in stage 1 to a vertex in stage i + 1 has the constant 
probability p; of being busy. In the current paper, we use Lee’s model 
to compare the blocking probabilities of different linear graphs. In 
particular, a t-stage linear graph is said to be superior to another t-stage 
linear graph if the blocking probability of the former never exceeds that 
of the latter for any choice of the p;. For a class of linear graphs known 
as SP-canoples, we give simple necessary and sufficient conditions that 
one t-stage linear graph is superior to another. 


I. INTRODUCTION 


We consider a ¢t-stage linear graph with a source (the vertex of the first 
stage) and a sink (the vertex of the last stage). All vertices are lined up 
in a sequence of stages such that any edge goes from a vertex in stage 1 
to a vertex in stage i + 1 for some 1, while each edge can be in either of 
the two states, busy or idle. The linear graph is said to be blocked if every 
path joining the source and the sink contains a busy edge. Lee! first 
proposed the concept of linear graphs in his study of the blocking per- 
formance of switching networks. We use Lee’s model and follow his in- 
dependence assumptions, namely, that the probabilities of occupancy 
for edges being busy in successive stages are independent. Thus, we may 
assume that any edge connecting a vertex in stage i with a vertex in stage 
t + 1 has some probability p; of being busy for 1 <1 < t — 1. The se- 
quence (p1,D9,-,0¢—1) will be called link occupancies of the t-stage linear 
graph. A linear graph is said to be superior to another if, for any given 
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link occupancies, the blocking probability of the first graph does not 
exceed that of the second graph. 

In this paper, we restrict ourselves to a special kind of linear graph, 
called an SP-canopy. By definition, the smallest SP-canopy is a series 
combination of two edges. Any other SP-canopy is either a parallel 
combination of two smaller SP-canopies or a series combination of a 
smaller SP-canopy and an edge. For readers familiar with graph-theoretic 
terminology, an SP-canopy can be viewed as the union of two rooted trees 
with identified sets of terminal nodes such that the union is a planar 
graph (see Fig. 1a for an example). 

Let e be an edge from a vertex a in stage i to a vertex b in stage i + 1. 
We define \(e) to be the ratio of the outdegree of a to the indegree of b. 
If all A(e), where e ranges over all edges between stage i and stagei + 1 
(for a fixed i), have the same value ); for 1 <i < t — 1, then this linear 
graph is said to be a regular linear graph (see Fig. 1b). Thus, a regular 
linear graph is associated with a unique degree sequence (Aj,A9,***,Az¢—1). 
In the case in which the regular linear graph is an SP-canopy, it can be 
uniquely represented by the degree sequence. Define \* = max; <;<t-1 
{A\1AgA;}. It is easy to verify that \* is just the number of distinct paths 
from the source to the sink. A regular SP-canopy is said to be a symmetric 
SP-canopy if A;\;-; = 1 for 1 <i <t — 1. Thus, the degree sequence of 
a symmetric SP-canopy can be written, abbreviated as (A1,-++,A,(¢-1)/2]), 
when [ x | denotes the greatest integer not exceeding x. The linear graph 


(b) 


(c) (d) 


Fig. 1—(a) An SP-canopy. (b) A regular linear graph. (c) A regular SP-canopy. 
(d) A symmetric SP-canopy. 
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in Fig. le is a regular SP-canopy, and the linear graph in Fig. 1d is a 
symmetric SP-canopy. 

We say a degree sequence ()1,A9,-*,Az-1) Majorizes another degree 
sequence (Aj, Ao,*)A;—-1) if and only if AyAgeA; = AjAoA; for every i, 1 
<i <t —1.Ifwe consider the set S; + of all regular canopies with a fixed 
number t of stages and a fixed number X* of distinct paths, we see that 
(A*,1,+-,1,(A*)—!) majorizes the degree sequence of any other regular 
SP-canopy in S; y+. In Ref. 2, comparisons are made involving all sym- 
metric SP-canopies with fixed t and \* for t odd, with the conclusion that 
the symmetric SP-canopy with the degree sequence \*,1,--,1((¢ — 1)/2 
1s) is superior to all the others and the symmetric SP-canopy with degree 
sequence 1,-,1,A* ((t — 1)/2 1s) is inferior to all the others. In this paper, 
we prove a stronger and more general result which says that one regular 
SP-canopy is superior to another one if and only if the degree sequence 
of the first one majorizes the degree sequence of the second one. 


ll. SYMMETRIC SP-CANOPIES 


In this section, we study symmetric SP-canopies with degree sequences 
of the form (Aj,A9,°*,A| (t-1)/2])- 

First, we prove a few auxiliary lemmas needed in the proof of the main 
result. 
Lemma 1: Define 


F(x) = (1 — a(1 — b*))*/*, where 0 <a, b < 1. 


Then F(x) ts monotone nondecreasing for 1 <x <k. 
Proof: We consider several cases: 
Case 1:0 <a,b <1. 


dF k 
Fy 2) = (all — be) - (— FInd — a(t — b)) 


kab ln b 
x(1-a(1- ay) 
We define 
1 ab*ln b 
G = —— In (1 —a(l1 — b*)) + 
(a) = —~ In L— al )) fatorD 


It is easy to verify that G(0) = G(1) = 0. Furthermore, by setting dG/da 
= 0, we obtain the unique solution ao which satisfies 


1 fs xb*In b 
1-—b* (1—5b*)2" 
Since d2G/da? (ao) < 0, ag is indeed a maximum. Therefore, G(a) > 


0 for allO <a <1. Thus dF/dx is positive for 0 <a, b < 1. 
Case 2:a = Oorb = 1. 


We have F(x) = 1 and dF/dx = 0. 





ago = 
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Case 3:a = 1, then 
F(x) = b*, dF/dx = 0. 
Case 4:a £ 0,1 andb=0 
F(x) = (1 —a)F, 


OE Gs Gn 
dx x? 


In any case, dF /dx is nonnegative. Therefore, Lemma 1 is proved. 

For a given vector a = (a1,Q9,"**,:-1), 1 2 a; = 0, and a sequence of 
positive real numbers ((1,--,6)(¢-1)/2|), we define the following func- 
tion: 

(1 — ag-1y2 -aet/2)le-vel if t is odd 
A(Bi@-/2)) = re ae 
(1 — ag/2)PLe-0/2) if t is even. 


and 


f(Bi,Bi+ 19°48) 0-1/2) = (1 — ajar-i(1 — f(Bi41,*,8) e-1)/2)))) % 


for i = 1,2,--,] (¢ — 1)/2] —1. 
Lemma 2: If the sequence (61,69,-*,8) (¢-1)/2|) Majorizes the sequence 
(81,8 2,++,B, ¢-1/2))) then we have 


f (81,828, (¢-1)/2)) S f(B1,B 2B) e-1/2))- 


for any vector (a},*,a;~1) satisfying 1 = a; = 0 for alli. 
Proof: It is easily checked that Lemma 2 is true for t = 2 and 3. By the 
induction assumption, we have 


f(B182/61,83,**,{ (e-1)/2) S f(B2583,**,8} ¢-1)/2))s 
since (81B2/8',83,**,81 (¢—1) /2) majorizes the sequence 
(B5,83,***,8) (t—-1)/2))- 
It is also clear that the following holds: 
f(81,81B2/81,83,,8,(e-1)/2)) S (81,8258) 0-1/2) 


Therefore, it suffices to show 
f (81,8258) (¢-1)/2)) S £81,818 2/8 1,83, B(e—1)/2))- 
Now we have 
f(B1,82,°,8)(¢-1))) = (1 — aaz—1(1 — f (89,8) ¢-1)/24))) 
= (1 — ayay—1(1 — (1 — agae—o(1 — f(B3,9°,8) (e-1)/2)))82)) 82. 
We set 
a= Q@1at-1, 
b= 1 — agaz—2 (1 — (83,8, ¢-1)/2))), 
k = ByBe. 


2918 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1978 


Then we have 


f(B1,B2,*B51 (¢-1)/2)) = (1 ae a(1 = b62))k/B2, 


Similarly, 


f(81,8182/61,635**,8) (¢-1)/2)) = (1 — a (1 — 5182/81) 1k/8 182, 
Since B2 < 8182/6), by Lemma 1 we have the following: 


f(B1,82,*,8) (¢-1)/2)) S f(B1,8182/B81,83,-58) (¢-1)/2)) S f(B1.B2s°*B1 e-1/2)> 


and Lemma 2 is proved. 

Theorem 1: Consider two t-stage symmetric SP-canopies with degree 
sequences (A4,*,\ (t—1)/2|) and (Ajz,*",A(¢-1)/2)) respectively. Then the 
first linear graph is superior to the second if and only if the sequence 
(A1,",AL(¢—1)/2]) Majorizes the sequence (Aj,,\ (:-1)/2))- 

Proof: If we let a; = 1 — p;, for 1 <i < t — 1,in Lemma 2, it is easy to see 
that the blocking probability P(\1,--,y (¢—-1)/2)) for the symmetric SP- 
canopy with degree sequence (Aj,++,\;(t—-1)/2|) has the same value as 
fa," ALE 1)/2))- Thus, the fact that (Aj,-~ AL(t— 1)/24) majorizes 
(Ay, te ~1)/2\) implies the symmetric SP-canopy with degree sequence 
(Az,-**,AL (¢—-1)/2}) is superior to the symmetric SP-canopy with degree se- 
quence (Aj,*,Aj(¢-1)/2|) 

We also want to show that, if the symmetric SP-canopy with degree 
sequence (Aj,-*,Aj (¢— 1)/2 |) is superior to the symmetric SP-canopy with 
peeree sequence Ons SALE y/2 p, then it is necessary to have (i, 

--, Mf (t—1)/2]) Majorizing (Aj,+ SAL(tH 1)/2}). Suppose, on the contrary, that 
tex exists an integer k, 1 < k <|(t — 1)/2| such that 14%, \; < 12, 
\;. We consider the link occupancies (p1,-,p;—1), Where pj = Pt-i = 
l-—eifl<is<kandp;=p;_;=0ifk <i <| (¢ — 1)/2|. Then for « suf- 
ficiently small, we have 

P(A1,%*,Ay (¢—1)/2]) = POt1,-*,Ax) 


= (1 — (1 — P(A, An))™ 
=l1- (Axe? + O(e4))(1 mas P()a,*,Az)) 


Rk 
=l1- e2k Il Ni + 0(e2" +2), 
i=1 


Similarly, PQ; oe M(t 1/2) is approximately 1 — «?” II, d;. Thus we 
have P(\j,-« A(t 1/24) < P(A1,,A, ¢-1)/21)- This contradicts the fact that 
the symmetric SP-canopy with degree sequence ()j,-+,Ay(¢—1)/2]) is su- 
perior to the symmetric SP-canopy with degree sequence (Aj,-,Aj (¢-1)/2)); 
and Theorem 1 is proved. 
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lll. REGULAR SP-CANOPIES 


From definitions in Section I, it can be easily seen that a regular SP- 
canopy is either a parallel combination of copies of a smaller regular 
SP-canopy or series combination of a smaller regular SP-canopy and an 
edge. A regular SP-canopy has many special properties, described in the 
following lemma. 

Lemma 3. Let (\4,+,A¢-1) be the degree sequence of a regular SP-canopy 
G. 

(i) AyAy-1 Is etther an integer or the reciprocal of an integer. If 
AiAz-1 Is an integer, G is a parallel combination of copies of a 
smaller regular SP-canopy as shown in Fig. 2a, where G’ has 
degree sequence (A,A¢~-1,A2,°*,\t—2). If A1Ag—1 Is the reciprocal 
of an integer, G is a parallel combination of copies of a small 
regular SP-canopy as shown in Fig. 2b where G” has degree se- 
quence (Xg,*+,A¢—2,At—1A1)- 

t—1 

(ii) JJ] A;=1. 
i=1 

(iii) If \y > lforsomek,1<k<t-—1,thend; =1foralli <k. If Xp 
<1forsomek’,1<k’ St —1,then 1 S1foralli>k’. 

Proof: Since G is a regular SP-canopy, the configuration of G can be easily 
shown to be either as in Fig. 2a or as in Fig. 2b. If G is as in Fig. 2a, G is 
a parallel combination of k copies of a regular SP-canopy (for some k) 
which is a series combination of G’ and an edge. Let G’ have degree se- 
quence (Aj,A9,*,A;—2). It is clear that the degree sequence of G is 
(RYg AgsAt—9,2 4) and AyAz—-1 = (RAi)R~! = 2; is an integer. If G is as 
in Fig. 2b, G is a parallel combination of k’ copies of a regular SP-canopy 
(for some k’) which is a series combination of an edge and G”. Let G” 
have degree sequence (Aj,A5,-*,A;—9)- It is easy to see that the degree se- 
quence of G is (k’,\y,Ao,**Ap—9(R’)71) and AyAg—1 = Rk’ (Ap-9(R’)7}) = Apo 
is the reciprocal of an integer. Since one of the two cases must occur, (7) 
is proved. 





Fig. 2—Regular SP-canopy G. 
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We may assume without loss of generality that G is as in Fig. 2a. By 
the induction assumption, we have 
Il‘=? \; = 1. Thus, we have 


t-1 7 a ee t-2 | 
ia d= kM (TI ni) k-t= TT N= 1, 
i=1 1=2 1=1 


and (it) is proved. 

Finally, \; being an integer implies \;, = d» is an integer if k > 1. By 
the induction assumption, )’;,j = 1,,k — 1, is an integer. Therefore, \1 
= k); is an integer and A; = 0/;,j = 2,-,k integer. The other half of (iiz) 
can be similarly verified. This proves Lemma 3. 

We note that (iz) holds for the degree sequence of any regular linear 
graphs. However, (z) and (iii) are not true for series-parallel, regular 
linear graphs. 

For a given vector, a = (a1,,a;-1), 0 S a; < 1 and positive real 
numbers ((1,-*:,6;-1), we define the following function g by 


8 (Bi,82—15) 
_ (C1 = ay (1 — g(Bo,+,82-1813a-2,2)))9! if 618-1 1, 
. la =O 8 (B1B1-1,825°**81—2302—9,1)))/Pe-1 otherwise, 
where az; is the vector (a;,,aj+-1) and g(f;3a1;) = 1 — aj. Note 
that 
8 (B18e-13) = 8 (Bi=1,82-2,°°,82 81 5a) 
where 
@ = (ay—1,0¢—2,",012,01). 
Lemma 4: For any vector a = (a},-**,a—-1), if the sequence (81,82,-,84—1) 
majorizes the sequence (},85,«-8;-;), then we have 
8 (B1,B2,8e-15a) S 8(B1,B9,+81-150). 

Proof: It is easy to see that Lemma 4 is true for ¢ = 2. When t = 3, we 
have 

& (81,8230) = (1 — ayarg)*1 S (1 — ayarg) = (81,8550). 


It suffices to consider t = 4. We note that 

(81,82,-+,8¢—-1) majorizes (6},-,8;—1) if and only if (6;41,--,A9 1,67 ') majorizes 
((Bt-1) 1 -*,(82)71,(61)7}). 

Therefore we may assume, without loss of generality, that 618,-1 < 1 (we 
may consider the inverse sequence otherwise). Then we have 


&(B1,°°,8:-130) = (1 ~ ay(1 4 8 (Bo,**,8¢—2,81-18 30t—2,2) ) 1. 


Let us consider several possibilities for B,6;-1. 
Case (i): B18;-1 < 1. 
Since B; < 61, and 6}6;-1 < 1, we have 
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8 (61,818 2/B1,83)*81-130) 
= (1 — ay(1 — g(818o/8},83,-*,8 1-18 304-2,2))) 9s. 
We want to show that 
8(B1.,Bt—-150) S 8(81,8182/81,83,-,82-150)- 


We note that B2(8:-181) = (8182/6) (6:-18,). It suffices to consider the 
following two cases. 


(a) B18 28:-1 <1. 


&(B1,*,8¢—-13a) 
= (1—ay (1 — (1 — ae(1 — g(B3,+,61-2,81-181 8 2304—3,3))) 82) ) Ft 


and 


2(61,8182/61,83,**:bt—130) 
= (1 — ay(1 — (1 — ae(1 — g(83,+,81-2,81-181B23a2—3,3)) ) 9192/61) )P1 


By using Lemma 1 and the fact that B2 < 6182/6}, it can be easily seen 
that 


8 (B1,°*,82-130) = & (81,818 2/61,83,*81-150). 
(b) 618 28,-1 > 1. 


The proof is similar to that of Case (a), and the proof is omitted. 
The next step is to show that 


8 (81,818 2/81,B3,*°sBt—-150) S 8 (81,805,811). 
We note that 6;8;-; < land 
8(Bi,°,8;-13a) = (1 — an(1L — 8 (Bo*+81—-18150¢e—2,2))) 44. 
Because ((3162/6},83,++,81-181) majorizes ((82,63,,8;—2,8:-181), we have 
8(Br,*°B1—-150) S 8(B1,8182/81,83)--,82-130) 
S g(B1,B2,,B1-13). 

Case (ii): 6;6;-1 > 1. 

Since (81,82,-+,8:-1) majorizes (81,82,-+-,8:-26+-161,81 |) and @16;" = 


1, we have, from Case (i), that 


& (B1,B2,+,82-150) S 8(B1,B2,+,81-281-181,817 15a)). 
It suffices to show that 


&(81,82,-*,82-281-181,87 30) s 8 (81,8258 4-130). 
It is easy to see that 


(81,(8:-281-181) ~},8723,**82 4,81) majorizes 
((8:-1)~1,-+;(82)-1,(8) —2) 


2922 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1978 


and 8187" < 1, (8;-1)-"(6;)-! S 1. 
From Case (1) we have 
8(B1,(8:-161-161) 8 7213)°*82 81 3a) s 2((Bi-1) 7}, «+, (Bo) 74, (8) “43a) 


where @ = (at-1, OLt—2,°**, 2,01). 
Therefore 


&(B1,B80,°,81-28+—-18 1.87 3a) = 8(B),Bo,**,8;—-13) 


and Lemma 4 is proved. 

It is easy to see that the blocking probability P(\y,-,A¢-1) of a regular 

SP-canopy with degree sequence (A1,-**,Az-1) for any link occupancy a 
= (a1,*,a;-1) has the same value as g(Aq,-**,A¢-1;a). By using Lemma 4, 
the following theorem can be proved by techniques similar to those used 
in the proof of Theorem 1. 
Theorem 2: Consider two t-stage regular SP-canoptes with degree se- 
quences (Aj,*,Az—1) and (Xj4,*,Az-1), respectively. Then the first linear 
graph ts superior to the second if and only if the sequence ()1,***,A¢—1) 
majorizes (X4,,A;—1)- 


IV. REMARKS AND EXAMPLES 


In Fig. 3 we list a few examples. The degree sequence of the symmetric 
SP-canopy G, in Fig. 3a is (3,2,1,2—!,3—!). The degree sequence of the 
symmetric SP-canopy Go in Fig. 3b is (2,3,1,3-1,2-!). The degree se- 
quence of the regular SP-canopy G3 in Fig. 3c is (2,3,37!,1,2—!), and the 


oe 
> 


(c) (d) 
Fig. 3—Examples. 
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(a) (b) 





Fig. 4— Linear graphs. 


Fig. 5—Series-parallel, regular linear graph. 


degree sequence of the regular SP-canopy in Fig. 3d is (2,3,371,274,1). 
By Theorems 1 and 2, we note that, since (3,2,1,2-1,3-1) majorizes 
(2,3,1,3-1,2~!) and so forth, then G, is superior to Gg, which is superior 
to G3, which is superior to G4. 

Although the result in this paper only involves SP-canopies, they can 

easily be generalized in the following ways: 

(t) Every edge between stagez and state i + 1 can be interpreted as 
a linear graph G;. 

(it) Some linear graphs with multiple edges can be viewed as SP- 
canopies by adding imaginary stages or vertices so that we could 
then apply our results. For example, in order to compare linear 
graphs in Figs. 4a and 4b, we put an imaginary stage between 
the second and third stages. The resulting linear graphs are 
shown in Figs. 4c and 4d, respectively. 
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Suppose m = n for the linear graphs in Fig. 4. It is easily seen that the 
degree sequence (m,n,(mn)~!) of the linear graph in Fig. 4c majorizes 
the degree sequence (n,m, (mn)7!) of the linear graph in Fig. 4d. Thus, 
we know that the linear graph in Fig. 4c is superior to that in Fig. 4d, and 
we may therefore conclude that the linear graph in Fig. 4a is superior to 
that in Fig. 4b. 

More generally, we may consider the class of all series-parallel, regular 
linear graphs. For example, the linear graph in Fig. 5 is a series-parallel, 
regular linear graph but not a regular SP-canopy. 

Is it true that a series-parallel regular linear graph is superior to an- 
other if its degree sequence majorizes the degree sequence of the other? 
We conjecture this is true. However, it seems that it cannot be proved 
by the methods we used in this paper. 
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An Analysis of 16 kb/s Sub-Band Coder 
Performance: Dynamic Range, Tandem 
Connections, and Channel Errors 


By R. E. CROCHIERE 
(Manuscript received December 26, 1977) 


In this paper, we examine the performance of sub-band encoding 
under a number of constraints which can exist in practical digital 
communications systems. In particular, we investigate the effects of 
varying input signal levels, tandem connections, and channel errors 
on the performance of sub-band coders. A coder bit rate of 16 kb/s is 
used in all the simulations. The dynamic range performance is evalu- 
ated for a 50-dB range of input signal levels. Tandem connections of 
up to four sub-band coders in tandem are examined. Finally, the effects 
of random channel errors on the performance of sub-band coders is 
examined for bit error probabilities of up to 10 percent. A robust coder 
design with partial bit error protection is also proposed for use in very 
high channel error environments. 

Three different performance measures were used in these simula- 
tions, the conventional signal-to-noise ratio, a segmental signal-to-noise 
ratio, and an LPC distance measure. By comparing the results of these 
various performance measures and from informal assessments of 
subjective quality, we gain some new insights into the advantages and 
disadvantages of these measures in terms of their usefulness in pre- 
dicting coder quality. 


Il. INTRODUCTION 


Sub-band coding has recently been proposed as a technique for ob- 
taining relatively good quality digital speech at a bit rate of 16 
kb/s.1-3.16 This quality is subjectively comparable to that of 24 kb/s 
ADPCM (adaptive differential PCM),!? and it is generally acceptable for 
some types of digital communications applications where relatively low 
transmission rates are required. 

In a practical communications system, the quality of digital speech 
can be affected and degraded by a number of factors. The input speech 


2927 


levels to the coders may vary over a relatively broad range, and the coders 
may not necessarily be driven at their optimum input levels. In a com- 
munications network, digital coders may be linked with other types of 
digital or analog systems, and it is possible that several tandem con- 
nections of the same type of coder may occur in a given transmission 
path. Finally, channel errors may occur in a digital system, and it is 
important to understand how the performance of digital coders are af- 
fected by these errors. 

In this paper, we present the results of a series of experiments designed 
to assess the performance and robustness of 16-kb/s sub-band coding 
in practical communications environments. The dynamic range of the 
coder is evaluated over a 50-dB range of input signal levels. Tandem 
connections of sub-band coders of up to four links are examined. The 
effect of channel errors is examined for error probabilities as high as 0.1. 
Finally, several methods for improving the robustness of sub-band 
coding in the presence of channel errors are examined. 


li. THE SUB-BAND CODER 


Sub-band coding is a waveform coding technique in which the speech 
band is partitioned into typically four or five sub-bands by bandpass 
filters. Each sub-band is then lowpass-translated to dc, sampled at its 
Nyquist rate, and then digitally encoded using adaptive PCM (APCM) 
encoding. By this process of dividing the speech band into sub-bands, 
each sub-band can be preferentially encoded according to perceptual 
criteria for that band. On reconstruction, sub-band signals are decoded 
and bandpass-translated back to their original bands. They are then 
summed to give a replica of the original speech signal. 

A particularly attractive implementation of the sub-band coder, in 
terms of hardware, is based on an integer band sampling approach.!? 
This approach uses the samplers both for discretizing the sub-band 
signals as well as for doing the lowpass and bandpass translations, i.e., 
the modulation is achieved with an impulse train instead of with sine 
and cosine signals. This implementation is illustrated in Fig. 1. Bandpass 
filters BP; to BPy in the transmitter and receiver serve to partition the 
input speech into sub-bands. The coders and decoders encode the sub- 
band signals and the multiplexer combines these digital signals into a 
single bit stream for transmission over the digital channel. In addition, 
the multiplexer inserts synchronizing bits into the bit stream for the 
purpose of synchronizing the operation of the transmitter and receiv- 
er. 

Table I shows the choice of bands and bit allocations used in the 16- 
kb/s coder. The coder is a 5-band design which was proposed in Ref. 2. 
Column 2 shows the frequency range covered by each sub-band. The bit 
allocation refers to the number of bits/sample used by the coders in each 


2928 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1978 






APCM 
DECODE 















CHANNEL 


MULTIPLEXER 
DEMULTIPLEXER 






APCM 
DECODE 
N 






Fig. 1—An integer-band sampling implementation of the sub-band coder. 


Table | — 16 kb/s 5-band sub-band coder 


Min 
Band Sampling Step-Size Bit 

Band Edges (Hz) Freq (Hz) (dB) Allocation Kb/s 
1 178-356 356 (Ref) 4 1.42 

2 296-593 593 0 4 2.37 

3 533-1067 1067 0 3 3.20 

4 1067-2133 2133 -3 2 4,27 

5 2133-3200 2133 -8 2 4.27 
SYNC — — 0.47 
16.00 


sub-band. As seen from the table, more accuracy is allowed for encoding 
the lower bands for reasons explained in Ref. 2. 

The frequency range of the coder extends from 200 to 3200 Hz. A plot 
of this frequency response is shown in Fig. 2. As seen in this figure, two 
small notches appear in the frequency response at 1067 and 2133 Hz. 


-10 


{ 
NO 
oO 


LOG MAG IN DECIBELS 
oo 
is) 


| 
4 
oO 





0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 
FREQUENCY IN KILOHERTZ 


Fig. 2—Frequency response of the 5-band coder in Table I. 
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These notches are due to the transition bands of the filters in bands 4 
and 5. Subjectively, these notches are not very perceptible. Bands 1 to 
3 are overlapped to avoid such notches at lower frequencies. The filters 
are sharp cutoff, 200 tap, FIR filters. 

Column 4 in Table I refers to the ratio of minimum allowed step sizes 
of the APCM coders (expressed in decibels), with the minimum step size 
of band 1 being the reference. This choice of minimum step sizes is dif- 
ferent than that suggested in Ref. 2 and was found to give a better 
matching of the dynamic ranges of the sub-bands. 

In Section VI, we propose an alternate design for a 4-band coder which 
can be used in a high channel error environment. 


lil. PERFORMANCE MEASURES 


3.1 Conventional s/n 


Several objective measures were used to evaluate the performance of 
the sub-band coder. In this section we briefly define each of these mea- 
sures. 

The most commonly used measure of performance of digital coders 
has been the conventional signal-to-noise ratio (s/n) evaluated over an 
utterance of speech. The speech power is defined as 


$=) x*(m) (1) 


m 


and the noise power is defined as 
n= (x(m) — y(m))2, (2) 


where x(m) and y(m) are the input and output signals of the coder, re- 
spectively, and the summations in (1) and (2) are taken over the entire 
speech utterance. The conventional s/n is then defined as 


s/n = 10 log($//). (3) 


In measuring the input and output signals to the sub-band coder, it 
is generally desirable to compensate for the effects of filtering in the 
coder, particularly effects of group delay. This is done by the circuit 
arrangement shown in Fig. 3. The input speech signal s(m) is sub- 
band-coded to form the output speech signal y(m). It is also filtered with 
the same filters used in the sub-band coder to generate a compensated 
reference signal x(m) which is used as the input signal in (1) and (2). 
Thus, the s/n ratio defined here is strictly a measure of coder distortions 
and is not affected by bandlimiting or group delay in the coder. 
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Fig. 3—Circuit for evaluating signal-to-noise ratios of the sub-band coder. 
3.2 Segmental s/n 


While the s/n measure is perhaps the most widely used criterion in 
measuring coder distortion, it has also long been known that it does not 
correlate well with subjective performance.2*> Another definition of 
signal-to-noise ratio, however, recently proposed by Noll,®’ does appear 
to correlate better with subjective performance. This measure is based 
on s/n measurements made over short segments of speech which are 
typically about 20 ms in duration. An average over all of the segments 
in the speech utterance is then taken to obtain a composite measure of 
performance for the entire utterance. If (s/n); corresponds to the sig- 
nal-to-noise ratio in decibels for a segment, 1 (computed in the same 
manner as in (3)), the segmental s/n, (SEG), is then defined as 


1N 

SEG = — ¥ (s/n),, (4) 
N i=1 

where it is assumed that there are N 20 ms segments in the speech ut- 

terance. 

Several problems occur in this definition of segmental s/n when re- 
gions of silence exist in the speech utterance. In segments where the input 
signal x(n) is essentially zero, any slight noise will give rise to large 
negative (s/n); ratios, and these segments may unduly dominate the 
average in (4). To prevent this anomaly, we first identify those segments 
which correspond to silence and exclude them from the average in (4). 
This is achieved by means of a simple threshold. Let $; represent the 
speech energy in a segment, 1, so that 


=~ ¥ 2%X(m) (5) 
=— x2(m), 

K m=1 
where K corresponds to the number of speech samples in the segment. 
Then the segment will be included in the computation of SEG in (4) if 
its energy exceeds a threshold o7; that is, if §; > 07. If it does not exceed 
this threshold, it is not included in the average in (4). Furthermore, to 


§; 
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prevent any one segment from dominating the average, we also limit the 
value of (s/n); to a range of —10 to +80 dB. That is, -10 < (s/n); < 80 
dB. 

It remains to determine the threshold o? for the speech/silence deci- 
sion. To establish this threshold, we coded a speech utterance composed 
of three concatonated sentences in which there was about 30 percent 
silence in the entire utterance. Figure 4 shows a plot of SEG as a function 
of o;. The threshold o; was varied from 0 to 32767 corresponding to the 
range of signal values representable in the 16-bit integer word length of 
the computer. The dashed line in Fig. 4 shows the number of segments 
included in the SEG measure. As seen in the figure when the threshold 
o, was below 3, virtually all the silence intervals were included in the SEG 
measure. The low (s/n); in these regions essentially dominated the sum, 
resulting in values of SEG of about 1 dB. When the threshold, o;, was 
raised to a value of 10, nearly all the silence regions were eliminated from 
the measure, and the value of SEG rose to about 9 dB. At a threshold of 
o, = 80, the value of SEG reached a plateau of about 10.3 dB. Therefore, 
the threshold was chosen to be o; = 30 for all SEG measurements. The 
conventional s/n for this same utterance was 10.8 dB. 


3.3 LPC distance measure 


A third performance measure that was used is the LPC distance 
measure proposed by Itakura.®:? This measure is based on an all-pole 
model of speech of the form 


s(n) = > a(m)s(n — m) + Gu(n), (6) 
m=1 


PERCENT SEGMENTS USED 
a 
° 
SEG IN DECIBELS 


30 SEG (dB) 
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Fig. 4—Segmental s/n as a function of the speech/silence threshold o;. 
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where s(n) is the sampled speech signal, a(m)(m = 1,... p) are the 
coefficients of an all-pole filter which models the resonances of the speech 
production mechanism, p is the number of modeled poles, G is the gain 
of the filter, and u(n) is the excitation source for the all-pole filter. 

The LPC distance measure for a segment, K, of speech (typically 20 
ms in duration) is then defined as 


dip = log | 


mand (7) 


b; Vi, bi 
where 


ap, = LPC coefficient vector (1, a1, ...a@,,) measured for the kth frame 
of the original (reference) speech signal s(n), 


b; = LPC coefficient vector measured for the kth frame of the coded 
(or processed) speech is s’(n), 


and Vs, is the speech correlation matrix of s’(n) whose elements V;; are 
defined as 


N-|i-j 
Vay=li=J =" s"nls'(n lis D), (8) 


where s’(n) is the processed speech signal. The overall distance measure 
for the speech utterance is then determined as the average over the N 
segments in the utterance, 


Mz 


~ 1 
dj=—)>dd 9 
1S ay 2 oie (9) 
By interchanging the roles of the reference and processed speech, a 
second distance measure can similarly be defined in the form?° 


1 


b; V, bj 
d= os | Vea | a9 
and 
2) af! ow 
d2= Ni dop. (11) 
An average distance measure can now be defined as 
d=5@,+4,) (12) 


which is the measure used in this paper. The LPC distance measure d is 
basically a measure of dissimilarity between the spectra of the processed 
and unprocessed speech. It is therefore useful in measuring the spectral 
distortion introduced by the coder. If the processed and unprocessed 
utterances are identical, then the distance d is zero. For small differences 
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between the processed and unprocessed signals, d will typically have a 
small positive value less than 1. Large values of d, greater than 1, gen- 
erally indicate significant spectral differences between the processed 
and unprocessed signals. 


IV. DYNAMIC RANGE 


The dynamic range of the sub-band coder is determined by the ratio 
of maximum to minimum allowed step-sizes in the APCM quantizers. In 
this work, we used a ratio of Amax/Amin = 128, which leads to an effective 
dynamic range of about 30 to 35 dB over which the quality remains rel- 
atively constant. Typically, if the Ayax/Amin ratio is increased, the dy- 
namic range of the coder increases (within limits) by about 6 dB per 
doubling of the ratio. 

To improve the performance of the sub-band coder at the low end of 
its dynamic range we also used a mid-rise/mid-tread switch in the APCM 
coders.!! This extended the useful range of the coders by about 6 dB and 
eliminated the low-level tones and idle channel noise generated by the 
APCM coders. 

Figure 5 shows the results of the s/n and the SEG measures for the 
coder for input signal levels over a range of about 50 dB. The measure- 
ments were made for a speech segment composed of two sentences, “High 
altitude jets whiz past screaming” and “A lathe is a big tool,” spoken by 
two different male speakers. As seen in the figure, the conventional s/n 
is high in the granular noise region of the coder (input levels less than 
—10 dB) and drops rapidly in the over-load region (input levels greater 
than 0 dB). It is controlled primarily by the high-energy region in the 
speech utterance. At low input levels, the s/n measure is typically too 
large. It fails to account for the low-level granular noise of the coders, 
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Fig. 5—Dynamic range of the sub-band coder: s/n and SEG measurements. 
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which can be subjectively disturbing. In the overload region, the s/n 
measure overemphasizes the clipping in the high-energy parts of the 
coded speech. 

The SEG measure agrees much better with our informal observations 
of quality. It is more sensitive to the granular noise at low levels and less 
sensitive to the overload in very loud parts of the speech. It essentially 
treats all time segments on an equal basis and does not favor high or low 
parts of the utterance. 

In examining the performance of the sub-band coder, it is also in- 
structive to observe the performance of the individual APCM coders used 
in the sub-bands. Results for s/n and SEG measurements for the 4-, 3-, 
and 2-bit coders are presented in Fig. 6, where the results of the 4-bit 
coder are obtained from measurements of sub-bands 1 and 2, the results 
of the 3-bit coder are obtained from sub-band 3, and results for the 2-bit 
coder are obtained from sub-bands 4 and 5. The solid lines refer to s/n 
measurements, and the dashed lines refer to SEG measurements. An 
important consideration in the design of the sub-band coder is that the 
dynamic range in each of the sub-bands be aligned so that, at the opti- 
mum input level, each sub-band is operating at its peak performance. 
This alignment is determined by the choice of maximum and minimum 
step sizes in the coders in each sub-band. The relative values of minimum 
step sizes (expressed in decibels) that we used are given in column 4 of 
Table IJ, which resulted in the alignment of the dynamic ranges shown 
in Fig. 6. 

Figure 7 shows the results of the LPC distance measurements on the 
sub-band coder. The measure was made between x(m) and y(m) ac- 
cording to the arrangement in Fig. 3 and, therefore, does not take into 
account the spectral distortions due to the filters or notches between the 
bands. At the optimum input level, the value of the LPC distance is 0.12. 
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Fig. 6—Dynamic range of the individual APCM coders in the sub-bands. 
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Fig. 7—LPc distance as a function of the input signal level. 


For low input levels, i.e., in the granular noise region, it goes up to 0.56 
at a —24-dB input level. At high input levels (+24 dB) in the overload 
or clipping region of the coder, the LPC distance goes up to 0.36. Thus, 
the spectral distortion is typically greater in the granular noise region 
than in the overload region of the coder. 

To determine the effect of the bandpass filters and the notches in 
frequency response of the filter, a second LPC distance measurement was 
made across the sub-band coder according.to the arrangement shown 
in Fig. 8. The input speech was delayed by a flat delay equal to the delay 
of the filters. This reference signal and the output of the coder were then 
both filtered with a 200- to 3200-Hz bandpass filter giving the signals 
£(m) and 9(m). The purpose of the bandpass filters on the outputs is 
so that the spectral differences outside of the 200- to 3200-Hz band of 
interest do not affect the LPC distance measure. 

When we measured the LPC distance between £(m) and ¥(m) by this 
method, we obtained a distance of 0.58 for the sub-band coder (operating 
at the optimum input level of 0 dB). We then removed the quantizers 


SUB—BAND 200 — 3200 
CODER BPF 
INPUT 
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s(m) 
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Fig. 8—Circuit arrangement for measuring the total LPC distance of the sub-band coder 
(including the effect of the filters) in a 200- to 3200-Hz bandwidth. 


2936 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1978 


from the coder and measured only the effects of the sub-band filtering. 
This resulted in a distance of 0.53 between X(m) and y(m). This distance 
is strictly due to the passband ripples, sharp transition bands, and 
notches in the frequency response of the coder as seen in Fig. 2. Although 
the contribution to the LPC distance due to the filters was greater than 
that due to quantization noise, their subjective effects cannot necessarily 
be weighted in the same way. Subjectively, the effects of the sharp cutoff 
filters and the notches do not strongly affect the quality or intelligibility 
of the coder. 


V. TANDEM CONNECTIONS 


Computer simulations of tandem connections of sub-band coders were 
made for up to four coders in tandem. Two types of tandem connections 
were considered in this experiment. The first type of tandem link consists 
of a sub-band coder followed by 16-bit linear PCM as shown in Fig. 9a. 
A parallel link of sub-band filters, shown in dotted lines, was also sim- 
ulated in order to generate reference signals to facilitate s/n and SEG 
measurements. 

In the second type of link shown in Fig. 9b, we simulated the effects 
of a digital-to-analog conversion and a resampling of the signals between 
each coder. This simulation was achieved by means of an all-pass filter 
which was inserted between the tandem links. Again, a reference link 
of sub-band filters was also simulated to facilitate signal-to-noise ratio 
measurements. The effect of the all-pass filter is to disperse the phase 
of the coder output so that the succeeding coders cannot synchronize 
their levels from link to link. Figure 10 is a plot of the group delay of the 
all-pass filter that was used. 

Figure 11 shows the results of s/n and SEG measurements for the 
tandem connections as a function of the number of tandem links. The 
solid lines refer to s/n measurements and the dashed lines refer to SEG 
measurements. The upper two curves refer to measurements made on 
the sub-band-to-PCM links in Fig. 9a, and the lower curves refer to 
measurements made on the sub-band-to-analog connections of Fig. 
9b. 

As seen in the figure, for the sub-band-to-analog connections, the s/n 
and SEG measures drop by roughly 3 dB per doubling of the number of 
tandem links, indicating that the quantization noise contributed by each 
link adds independently of other links. 

In the sub-band-to-PCM connection, however, it is seen that the 
quantizer distortions do not add independently. After the first encoding, 
the succeeding coders tend to synchronize their quantizer levels to those 
of the first coder, and in this way they do not add any further distortion 
to the signal. This result is somewhat surprising in view of the fact that 
the quantizers in the sub-bands are separated by interpolating and de- 
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9—Circuits for measuring performance of tandem connections of sub-band coders. 
(a)'s Sub. band/PCM links. (b) Sub-band/analog links. 


cimating filters and all the sub-bands are summed at the outputs of the 
coders between links. In one example, we observed an s/n of 6.8 dB in 
the fourth sub-band of the first link. In the succeeding links, the s/n of 
this same coder went up to 18 dB in the fourth sub-band due to this 
synchronization effect. 

Figure 12 shows the corresponding results for the LPC distance mea- 
surements on the tandem connections. Here again we see that the sub- 
band-to-PCM link performs better than the sub-band-to-analog link. 
A maximum LPC distance of 0.29 was observed for four sub-band-to- 
analog tandem connections, indicating that successive tandem connec- 
tions do not excessively distort the spectrum of the coded speech over 
that of the initial coding. 

Based on informal listening, the quality of two tandem connections 
does not appear to be much different than that of one encoding. For three 
sub-band/analog encodings, the differences become apparent, and with 
four links the differences are clearly noticeable. 
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Fig. 10—Group delay as a function of frequency for the all-pass filters used to simulate 
analog links. 
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Fig. 11—s/n and SEG measurements for the sub-band/PCM and sub-band/analog tandem 
connections. 


Vi. CHANNEL ERRORS 


The analysis of the sub-band coder performance under channel errors 
constituted the largest part of our experimental investigations. The coder 
performance was analyzed for bit error probabilities of up to 10 percent. 
We first analyzed the individual 4-, 3-, and 2-bit APCM coders in each 
of the sub-bands in order to assess their performance separately under 
channel errors. We then examined the use of a robust step-size adaption 
algorithm! in order to enhance the performance of these individual 
coders. For the 4- and 3-bit coders, we also investigated the use of partial 
bit error protection of the sign and most significant bits in the coders. 
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Fig. 12—LPc distance measurements for the tandem connections. 


Based on these results, we then considered three overall sub-band 
coder designs. The first design was the 5-band coder described in Section 
IJ. The second design was the same coder with the robust step-size 
adaption algorithm for its APCM coders. In the third design, we consid- 
ered a 4-band coder with a reduced bandwidth and a slightly lower bit 
rate. The remainding bits were applied to a partial bit error protection 
scheme to enhance its robustness under conditions of very high channel 
errors. We then analyzed and compared the performance of these three 
coders under channel errors. 


6.1 The robust quantizer 


The step-size adaption algorithm used in the sub-band coder is based 
on the one-word step-size memory scheme proposed by Jayant, Flana- 
gan, and Cummiskey.*!° The coder input signal is quantized to one of 
28 levels, where B is the number of bits in the coder. The step-size 
adaption circuit examines the quantizer output bits for the (r — 1)th 
sample and computes the quantizer step-size, A,, for the rth sample 
according to the relation 


A, os A,-1M(L,-1), (18a) 
where 
Amin < Ar & Amax (13b) 


and where A,~ is the step-size used for the (r — 1)th sample. M(L,-1) 
is a multiplication factor whose value depends on the quantizer magni- 
tude level L,-; at time r — 1. It can take on one of 22—! values M1, Mo, 
... Mae-». If the lower-magnitude quantizer levels are used at time r 
— 1,a value of M(L,_,) = M; less than one is used to reduce the next 
step-size. If upper magnitude levels are encountered, a value of M; 
greater than 1 is chosen. In this way, the coder continuously adapts its 
step-size in an attempt to track the short-time variance of the input 
signal. 

A disadvantage of the above adaption scheme is that, once a step-size 
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error occurs, it remains in error until the maximum or mininum step-size 
is reached. A modification of this algorithm, proposed by Goodman and 
Wilkinson!” allows for the step-size computation to be less sensitive to 
past errors. This “robust” algorithm is based on the relation 


A, = (A,—1)?-M(L,;_1), (14a) 
where 
Amin < A, < Amax: (14b) 


The parameter @ is chosen to be slightly less than 1, and it determines 
how rapidly the effects of past errors are dissipated. In the limit when 
B goes to 1, the algorithm reduces to that of (13a). 

As the value of 6 is reduced, the M values must be adjusted to com- 
pensate for its effect on the step-size adaptation. As shown in Ref. 12, 
this compensation can be obtained by a simple scaling of the M values. 
If M; represent the ideal M values for step-size adaption when 6 = 1, then 
the new M values, denoted as M,;, are approximately 


M; = GM; p= 1.2, ...28-h (15) 


where G is a scaling factor that is dependent on 8 and on the expected 
value of A,. In computer simulations, we determined G by optimizing 
the performance of the coders as a function of this scaling factor. Figure 
13 is a plot of G as a function 6 that was used in our simulations. It is 
based on an expected value of A, in the range of 500 to 5000, typically 
encountered in our computer simulations. As seen in the figure, when 
6 varies from 1 down to 15/16 optimum scaling factor, G, increases from 
a value of 1 to about 1.5. 





1. 
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Fig. 13—Multiplier scaling factor, G, as a function of 8, used for computer simulations. 
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6.2 Performance of individual coders under channel errors 


The performance of the individual APCM coders was examined, in 
terms of s/n and SEG measurements, as a function of the bit-error rate 
and the robust quantizer parameter, @. Figures 14a to 14c show the re- 
sults for the s/n measurements for the 4-, 3-, and 2-bit coders, respec- 
tively, as a function of bit-error rate where the bit-error rate corresponds 
to random channel errors. Figures 15a to 15c show similar results for the 
SEG measurements. Four values of 8 were used: 1, 63/64, 31/32, and 
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Fig. 14—s/n performance of the APCM coders as a function of the bit error rate. (a) 4-bit 
coder. (b) 3-bit coder. (c) 2-bit coder. 
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Fig. 15—-SEG performance of the APCM coders as a function of the bit error rate. (a) 4-bit 
coder. (b) 3-bit coder. (c) 2-bit coder. 


15/16. Each value of 8 corresponds to one curve in the plots. The bit error 
rate is plotted on a log scale and covers a range of 104 to 1071. 

Several conclusions can be drawn from the results in Figs. 14 and 15. 
It is seen that the 4-bit coder is the most vulnerable to channel errors 
and that the 2-bit coder is the least vulnerable. Fortunately, the 4-bit 
coder receives the most improvement from the use of a robust step-size 
algorithm. The 2-bit coder, however, receives the smallest improvement 
from the robust algorithm. 
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The robust quantizer does not seem to affect the s/n and SEG mea- 
surements when no channel errors are present until the value of 8 is re- 
duced below a value of about 31/32. This is assuming that the M values 
in the coder are appropriately scaled as discussed in the preceding sec- 
tion. If the M values are not properly scaled, then the performance of 
the coders will be significantly reduced as 6 decreases. For example, the 
performance of the 3-bit coder drops by about 6 dB in s/n and 3 dB in 
the SEG measure when £ is reduced from 1 to 31/32 and the M values are 
not scaled according to (15). 

The optimum choice for the robust quantizer parameter, @, for pro- 
tection against channel errors appears to be about 31/82. 


6.3 Partial bit error correction 


Since the 4- and 3-bit coders are the most vulnerable to channel errors, » 
the lower sub-bands which use these coders are affected the most by 
channel errors. Subjectively, these are also the most important bands 
since distortions in these bands quickly deteriorate the quality of the 
sub-band order. 

One way to maintain the quality in these lower sub-bands is to provide 
for some partial bit-error correction in the transmission of these coder 
bits. In this section, we investigate the effect of sign and/or most sig- 
nificant magnitude bit protection on the performance of the 3- and 4-bit 
APCM coders. 

To provide for error correction of transmitted bits, extra parity bits 
must be transmitted by the coder.!415 The degree of error protection 
that is achieved is strongly dependent on the design of the error pro- 
tection block codes, the bit error rate of the channel, and the percentage 
of additional redundant bits that are transmitted for error protection. 
Fortunately, since the lower sub-bands typically have low sampling rates 
and therefore low transmission rates, the additional transmission rate 
required to provide partial bit-error correction of some of the bits in these 
lower sub-bands should be relatively small compared to the overall 
transmission rate of the coder. In this work, we have avoided issues of 
specific designs of block codes for bit error correction. We have instead 
assumed that ideal or nearly ideal error protection can be achieved. The 
results that we present should therefore be interpreted as upper bounds 
on what can be achieved, given a sufficient amount of extra transmission 
rate for error protection. 

Figure 16a and 16b show results of s/n and SEG measurements on the 
4-bit APCM coder, as a function of the bit error rate, for several bit error 
protection schemes. In all the results, a robust quantizer with 8 = 31/32 
is used. The solid line shows the performance when no bit-error correc- 
tion is used. The long dashed curve shows the results when the sign bit 
is ideally protected, and the short dashed line shows the results when 


2944 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1978 


SIGN & MOST SIG MAG 
“ BIT PROTECTION 
__ —— MOST SIG MAG 
BIT PROTECTION 


NOERROR  _ 
PROTECTION — 


SIGN— BIT 
PROTECTION —— 


s/n IN DECIBELS 





_— SIGN & MOST SIG MAG 
we BIT PROTECTION 


‘a 
=> Ta. MOST SIG MAG 


= -~ BIT PROTECTION 
14 = age < 
NO ERROR fF OCS ES 
10 PROTECTION — ~ \ \ ~~ 
‘\ 


S SIGN-BIT _-~ \ “< 
PROTECTION ~ 


SEG IN DECIBELS 
'e, 
> 
74 
ws 


(b) \ 
_ 10-3 10~2 10-1 
BIT ERROR RATE 


Fig. 16—Performance of the 4-bit APCM coder with partial error protection. (a) s/n 
measurements. (b) SEG measurements. 


the most significant magnitude bit is ideally protected. Finally, the long 
and short dashed curve shows the performance when both the sign bit 
and the most significant magnitude bit are protected. As seen in the 
figure, protection of the most significant magnitude bit alone gives a 
better performance than when the sign bit is protected alone. This occurs 
because an error in the sign bit results in a single isolated error, whereas 
an error in the most significant magnitude bit causes a step-size error 
which propagates for many samples. At high bit-error rates, significant 
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improvements in coder performance are possible with bit-error protec- 
tion. 

Figure 17 shows similar results for the 3-bit APCM coder. In this case, 
the protection of only one bit was considered, either the sign bit (long 
dashed line) or the most significant magnitude bit (short dashed line). 
It is seen that protecting the sign bit leads to about the same improve- 
ment as the most significant magnitude bit. In comparing Figs. 16 and 
17, it can be seen that the improvement of the 3-bit coder performance 
with error protection in high channel errors is not as large as the im- 
provement obtained for the 4-bit coder. 


6.4 A sub-band coder design for high channel errors 


As noted in the previous section, when high channel errors are en- 
countered, it is possible to divert a part of the transmission rate to the 
protection of bits in the lower sub-band(s). In this way, some of the coder 
quality at low channel error rates can be traded for more robustness of 
the coder at high channel error rates. In this section, we consider an ex- 
ample of such a design. 

Table II shows the choice of bands and bit allocations for a 4-band 
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Fig. 17—Performance of the 3-bit APCM coder with partial error protection. (a) s/n 
measurements. (b) SEG measurements. 
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Table Il — 16 kb/s 4-band coder with partial bit error correction 


Min 

Band Sampling Step-Size Bit 
Band Edges (Hz) Freq (Hz) (dB) Allocation Kb/s 
1 250-500 500 (Ref) 4 2.0 
2 500-1000 1000 -19 3 3.0 
3 1000-2000 2000 —6 2 4.0 
4 2000-3000 2000 —10 2 4.0 
SYNC AND ERROR CORRECTION Tar 
16.0 


16-kb/s coder with partial bit-error correction in the lowest band. The 
frequency response of this coder is shown in Fig. 18. In comparison to 
the 5-band coder, it is seen that this coder has a narrower overall band- 
width and an additional notch in its frequency response. Thus, the 
quality of this coder tends to be more reverberant than that of the 5-band 
design. The LPC distance measure for this coder, measured according 
to Fig. 8, was 0.82 compared to 0.58 for the 5-band coder. When the 
sub-band filters alone were measured, a distance of 0.69 was observed 
compared to 0.53 for the 5-band coder. 

In trade for this reduced quality, the 4-band coder has 3 kb/s of rem- 
aining transmission rate or 18.75 percent of its total transmission rate 
which can be used for bit error protection. This is applied to the pro- 
tection of the sign and most-significant magnitude bits of the 4-bit coder 
in the first sub-band. 


LOG MAG IN DECIBELS 





0) 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 
FREQUENCY IN KILOHERTZ 


Fig. 18—Frequency response of the 4-band coder in Table II. 
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6.5 Overall performance of the sub-band coders with channel errors 

In this section, we present the results of computer simulations of three 
different sub-band coder designs under conditions of channel errors. The 
simulations were made with random channel errors with error rates of 
up to 10—!. The first coder, coder A, is the 5-band coder in Table I with 
no robust quantizer (i.e., 8 = 1). Coder B is the same 5-band design with 
a robust quantizer with 8 = 31/32. Coder C is the 4-band design, in Table 
II, for high channel errors. It has a robust quantizer, 8 = 31/32, and as- 
sumes ideal error protection of the sign and most-significant bit in its 
first sub-band (the 4-bit APCM coder). 

Figure 19 shows the results of the s/n and SEG measurements for the 
three coders as a function of the bit error rate. Coders A and B, the 5- 
band designs, have the best performance and quality at very low error 
rates. The use of the robust quantizer does not significantly reduce the 
performance of Coder B (assuming the M values are scaled properly) 
at low error rates. The 4-band design has a somewhat lower quality at 
low error rates due to its reduced bandwidth and lower effective trans- 
mission rate. 

As the bit error rate increases, the performance of the unprotected 
coder, coder A, drops rapidly. Channel error distortions are noticeable 
at error rates of 2 X 107. At error rates of 5 X 10-3 and 10-2 the quality 
drops rapidly and at error rates of 2 X 10-2 the coder is essentially un- 
intelligible. 

The use of the robust quantizer significantly improves the perfor- 
mance of the 5-band coder for moderate error rates. Coder B has no- 
ticeable degradations in quality at bit-error rates of about 10~2. At error 
rates of 2 X 10-2, this quality degrades rapidly and at error rates of 5 X 
10-2 the coder starts to become unintelligible. 

The 4-band coder, coder C, holds up well for error rates up to about 
2 X 107? before the effect of channel errors becomes noticeable. Its 
quality, however, is slightly lower to begin with. At error rates of 10~!, 
the quality degrades sharply although the coder still appears to be quite 
intelligible. 

Figure 20 shows the results of the LPC distance measure on the three 
coders. These results do not appear to agree well with s/n and SEG 
measures nor do they agree well with our informal subjective observa- 
tions. For example, at bit error rates of 10-2 the LPC distance of coder 
A is 0.35, indicating that the coder should have reasonably good quality. 
In fact, the subjective quality of the coder at this point was significantly 
degraded. Also, the LPC distance failed to sufficiently distinguish the 
differences in quality between coders B and C at high error rates. 

To investigate this problem in more detail, we plotted the individual 
segmental LPC distances dj, and d», defined in (7) and (10) as a function 
of time (measured in segments). Figure 21a shows these results for coder 
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Fig. 19—Performance of the three sub-band coder designs under channel errors. (a) 
s/n measurements. (b) SEG measurements. 


A at a bit error rate of 10~? for two concatenated sentences. From this 
plot, it becomes clear as to what is happening. On the average, the coder 
performance is quite good. However, in about 10 or 12 isolated segments, 
severe distortions were observed where channel errors occurred in lower 
sub-bands. Because of these isolated errors, the entire sentence sounds 
poor in quality. Figure 21b shows similar results for coder A at error rates 
of 5 X 10-2. Again, it is seen that there are numerous segments in which 
the distortions are intolerable; however, on the average, the distortion 
was not that bad; 1.e., it was below 1. Subjectively, the presence of these 
large errors made the sentence virtually unintelligible. 


16 kb/s SUB-BAND CODER PERFORMANCE 2949 


CODER B 
(B = 31/32) ~ _ 


CODER C 
(8 = 31/32 AND BIT 
ERROR PROTECTION) 


LPC DISTANCE 


CODERA 





0 1073 10-2 10-1 
BIT ERROR RATE 


Fig. 20—LPc distance as a function of bit-error rate for the three sub-band coder 
designs. 


From these observations, we conclude that the LPC distance, used 
properly, is in fact a good indicator of quality. However, when an overall 
measure of quality for an utterance is required, something more so- 
phisticated than a simple mean of the segmental distances must be used. 
This is particularly important in the case of channel errors. 


Vii. CONCLUSIONS 


In summary, a number of general conclusions can be drawn from the 
results of this work. 

(1) When the maximum-to-minimum step-size ratios of the APCM 
coders is 128 and the dynamic range of sub-bands are properly aligned, 
the quality of the coder remains relatively constant over a range of input 
levels of about 30 dB. This range increases by about 6 dB per doubling 
of this step-size ratio. The idle channel noise performance of the coder 
can be improved by the use of a mid-rise/mid-tread switch on the 
quantizers in the APCM coders. 

(ti) For tandem connections of sub-band coders with conversion to 
analog format between links, the signal-to-noise ratio drops by roughly 
3 dB per doubling of the number of tandem coders. When linear phase 
FIR filters are used in the coders and they are connected by PCM links, 
the step sizes of the coders tend to synchronize, and the performance of 
the tandem connection improves. 
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Fig. 21—LPCc distance as a function of time for (a) coder A at a bit error rate of 10-2 and 
(b) coder A at a bit error rate of 5 X 10-2 (note a difference in scale). 


(wi) The effects of channel errors in an unprotected sub-band coder 
are first observed at bit error rates of about 2 X 1078. At error rates of 
2 X 10-2, the quality of the coder is essentially unintelligible. When a 
robust quantizer algorithm is used, errors are first noticeable at bit error 
rates of about 10-2, and at error rates above 5 X 10-2 the coder becomes 
unintelligible. When both a robust quantizer and partial bit-error pro- 
tection is used in the lower sub-band(s), the effect of channel errors is 
not significant until error rates of about 2 X 10~? are reached and the 
coder appears to be intelligible at error rates as high as 10~!. The above 
results are based on the assumption that sufficient protection is provided 
for the synchronization and parity bits so that no loss of synchronization 
occurs between high channel errors and that nearly ideal error protection 
is possible for the coder bits which are protected in the partial bit-error 
protection scheme. 
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A Mid-Rise/Mid-Tread Quantizer Switch for 
Improved Idle-Channel Performance in Adaptive 
Coders 


By R. E. CROCHIERE 
(Manuscript received November 17, 1977) 


A mid-rise/mid-tread switch is proposed for improving the idle 
channel performance of an adaptive quantizer. The method incorpo- 
rates the advantages of both mid-rise and mid-tread quantizer char- 
acteristics. 


In adaptive waveform coding such as ADPCM (adaptive differential 
PCM), ADM (adaptive delta modulation),!? and sub-band coding,?4 the 
quantizer step-size in the coder varies in accordance with the short-time 
energy of the signal being coded in order to take advantage of its non- 
stationary properties. In practice, these types of coders generally have 
minimum and maximum limits on their step-size. Furthermore, they 
often use a mid-rise quantizer characteristic as depicted in Fig. 1 where 
x denotes the input signal level, ¢ denotes the discrete output signal 
levels, and A denotes the quantizer step-size. This mid-rise characteristic 
is desirable because of its symmetry and because it uses the 28 possible 
levels of a B-bit coder efficiently. 

A disadvantage of this mid-rise characteristic is that it cannot repre- 
sent a zero output level. During very low, or zero, input signal intervals 
(such as silent regions in speech), the output of the coder must be +Anin, 
where Anin is the minimum step-size in the coder. Generally, Amin is 
chosen to be small enough so that this signal is very low. Unfortunately, 
in many coder designs the sign of the output signal varies in a systematic 
pattern which can be perceived even for very low values of Ayin. 

For example, in ADPCM with a zero input level, the coder output level 
can oscillate between +Amin, creating (in a speech coder) a low level tone 
at half the sampling rate. Because of the sensitivity of our hearing 
mechanisms to tones, this tone can be perceived even at very low levels 
(i.e., very small A,ins). In sub-band coders, this problem is compounded 
further by the fact that sub-bands of speech are lowpass-translated to 
dc, encoded and decoded, and then bandpass-translated back to their 
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Fig. 1—Mid-rise quantizer characteristic. 





Fig. 2—Mid-tread quantizer characteristic. 


respective bands. This modulation process can translate such small tones 
directly into the middle of the speech band where they are particularly 
noticeable. Even a dc level of + Amin or —Amin Will appear as a low-level 
tone when modulated to the middle of the speech band. Furthermore, 
decaying exponentials, in which the quantizer systematically uses its 
lowest level as its step-size decays to Ain, Will appear as decaying tones 
in this situation. 

One way to alleviate the above situation is to use a mid-tread quantizer 
characteristic as shown in Fig. 2. Unfortunately, this characteristic has 
an odd number of levels (if it is symmetric) or it must be nonsymmetric 
about zero. Therefore, it does not use the 2? possible levels of a B-bit 
quantizer efficiently. 

For adaptive quantizers, fortunately, there is a solution to the above 
problem. Since the quantizer step-size A varies with the short-time en- 
ergy of the signal being coded, it also tells us when the input signal level 
is near zero. We propose a mid-rise to mid-tread switch on the decoder 
quantizer output which occurs when the step-size A falls below some 
threshold A;,. That is, when A < A;y, the two lowest levels of the 
quantizer in the decoder are switched to zero to give a mid-tread char- 
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Fig. 3—(a) Output of a sub-band coder with mid-rise quantizers as the step-size is 
decaying. (b) The same output using the mid-rise/mid-tread switch. 


acteristic as in Fig. 2. When A > Ajp, the characteristic in Fig. 1 is used 
because of its more efficient use of levels. Typically, a practical choice 
of Ay, is about 1.5 Amin to 3 Amin. 

The above mid-rise/mid-tread switch has been used successfully in 
a sub-band coder. It greatly improved the performance of the sub-band 
coder when it was driven at the low end of its dynamic range. The slight 
amount of center-clipping introduced by the mid-tread characteristic 
(at very low input levels) was found to be greatly preferred to the low- 
level tones and idle channel noise of the mid-rise quantizer characteristic. 
Figs. 3a and 3b show the results of a sub-band coder output, as the 
quantizer step-sizes are decaying, for coders with and without the mid- 
rise/mid-tread switch. As seen in the figure, the low level tones and the 
coder noise are completely eliminated by the mid-rise/mid-tread 
switch. 
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Zone-Balanced Networks and Block Designs 


By F. R. K. CHUNG 


(Manuscript received November 30, 1977) 


A switching network can be viewed as a collection of interconnected 
crosspoints that provides connection between input terminals and 
output terminals. Many networks have the property that the set of 
input terminals and output terminals can be partitioned into a number 
of zones such that the requests for connection between an input ter- 
minal and an output terminal in the same zone are more likely than 
those connecting terminals in different zones. In this paper, we study 
the structure of switching networks of this type by the use of block 
designs. 


Il. INTRODUCTION 


We shall consider multistage switching networks composed of rec- 
tangular switches. For an input terminal u and an output terminal uv’, 
the channel graph for u and u’, denoted by G(u,u’), is defined to be the 
union of all paths that can be used to connect u and uw’. (A channel graph 
is also called a linear graph.) A network is said to be balanced if all 
channel graphs G(u,u’), where u is in the set J of input terminals and 
u’ is in the set 2 of output terminals, are isomorphic.!:? A network is said 
to be zone-balanced if it has two nonisomorphic channel graphs, say G1 
and Gg, so that the channel graph G(u,u’) is isomorphic to G, if u and 
u’ are in the same zone, and G(u,u’) is isomorphic to Gg if u and u’ are 
in different zones. (See Fig. 1. Note that the switches in the network can 
be viewed as nodes of the corresponding graph.) G, is called the internal 
graph and Gg is called the external graph of the switching network. 

A zone-balanced network usually consists of three parts. The primary 
part consists of a few stages where traffic distribution takes place within 
each zone so that each input terminal has sufficient access to the central 
stage. The secondary part is the central stage which provides intercon- 
nections between different zones. The tertiary part plays the same role 
for the output terminals as the primary part does for the input termi- 
nals. 
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Fig. 1—(a) A zone-balanced network. (b) The internal graph. (c) The external graph. 


The primary part is, in fact, composed of a number of distribution 
networks, each of which provides traffic distribution within each zone. 
We first investigate distribution networks in Section II. In Section III, 
we introduce the concept of block designs, which will then be used for 
connecting the distribution networks in the primary part and the 
switches in the central stage (Sections IV and V). In Section VI, we study 
zone-balanced networks of more general types. 


ll. DISTRIBUTION NETWORKS 


Figure 2a illustrates an example of a distribution network. The dis- 
tribution graph for an input terminal u is defined to be the union of all 
paths containing u. In a distribution network, the distribution graphs 
for any two input terminals are isomorphic. The distribution graph of 
the distribution network in 2a is shown in 2b. The labeling function f 
is explained later in this paper. 

We consider a distribution network M,, which is an s-stage network 
with switches in stage i having size n; X m; for 1 <1 < s. The distribution 
graph is then as shown in Fig. 3. 

The switch sizes (n; X m;, 1 <i < s) and the number s of stages are 
generally dependent upon the traffic loads and the number of input 
terminals in each zone in order to reduce the “cost” (the number of 
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Fig. 2—(a).A distribution network. (b) The distribution graph. 
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Fig. 3—A general distribution graph. 


crosspoints) of the network.?:+ Here we will only study the structure of 
distribution networks and will not be concerned with the complicated 
problem of determining s and n;,m;,1 <i Ss. 

A distribution network can usually be constructed recursively. Figure 
4 illustrates a complete distribution network in which the number of inlet 
lines is the product of n;, 1 <1 < s, and the number of outlet lines is the 
product of m;. We note that, in a complete distribution network, we have 
p = p’ in Fig. 4 and exactly one link exists between a copy of M;-1 and 
a switch in the last stage of M; for 1 <i Ss. 

Here is an explicit method for the interconnection in the complete 
distribution network M,. While the notation may appear at first to be 
somewhat complicated, it will turn out to be very useful and precise in 
specifying the link connections of the network. 
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Fig. 4—A complete distribution network. 


Let fi (ir,ia,. hie sls—1) denote the (ty + Lom, fcatk ls—1M. SC NtesoP 
1)th switch in stage j where 0 < lg Sm ,1sqss-1. 
. Let fa (1,02,. .. ,ls—1) denote the (gq + 1)th outlet line of the switch 
fi (i1,... ,ts-1) where 0 <q < mj. Then we have: 


fii, . .«,ls—1) is connected to 
pis 2 Late; tee sea) 


forallj,1<j<s. 

Sometimes the values of n;, m;, 1 < i < s, do not allow complete con- 
nections between consecutive stages, i.e., p < p’ and the number of inlet 
lines of the zone is not equal to the product of n;, 1 <i < s (see Fig. 5). 
These distribution networks are said to be incomplete distribution 
networks. In this case, an appropriate connection, according to some 
rules to assign links cyclically, usually can result in a distribution network 
of this type (called cyclic distribution network). Here we give a scheme 
of constructing a cyclic distribution network in which copies of M;-; and 


Fig. 5—An incomplete distribution network. 
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the switches of the last stage of M; are connected according to the con- 
nection pattern of a regular bipartite graph as follows. 

Consider a bipartite graph with sets of nodes A and B. A link connects 

anode in A toa node in B. A bipartite graph is said to be regular if the 
degree of each node in A is equal to some integer x and the degree of each 
node in B is equal to some integer y. It then follows that ax = by where 
a = |A|, b= |B|. We restrict ourselves to the case that a = y, b => x. We 
will show that the condition ax = by is sufficient for the existence of such 
a regular bipartite graph. We consider the following two possibilities. 
Case 1: a and y are relatively prime. In this case, y divides x. Let 
Qy,...,Qq denote nodes in A and j,. .. ,8, denote nodes in B. We will 
then connect a; to 6; where j =i x/y + k (mod b) where 1 <k <x, and 
1 <j <b. It is easy to see that the resulting graph is a regular bipartite 
graph (see Fig. 6a). 
Case 2: Let d be the greatest common divisor of a and y. Then we can 
construct, by Case 1, a regular bipartite graph G’ on sets of nodes A’ and 
B’ where |A’| = a’ = a/d and |B’| = b. The degrees of nodes in A’ are all 
equal to x’ = x and the degrees of nodes in B’ are all equal to y’ = y/d. 
Now, we construct the regular bipartite graph on A and B as follows: The 
set A can be viewed as d copies of A’. The connection between each copy 
of A’ and B is the same as G’. It is easily verified that the resulting graph 
is regular and a node in A or B has degree x or y, respectively (see Fig. 
6b). 

Now we can construct the incomplete distribution network M; by 
connecting a copies of M;-; and b copies of n; X m; switches if ap = bn; 
(p is the number of outlet lines of M;-,), according to the regular 
bipartite graph we described above by taking each copy of M;_1 as a node 
in A and each switch n; X m; as a node in B. 





(a) (b) 
Fig. 6—Regular bipartite graphs. 
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We note that there are many nonisomorphic regular bipartite graphs 
of the given sizes. Therefore, there are many different incomplete dis- 
tribution networks of the same “cost” (number of crosspoints). In Section 
IV and V we see some incomplete distribution networks obtained by 
some methods other than the cyclic connections described here. Those 
methods are based on a basic concept in combinatorics called a block 
design, which we next describe. 


ill. BLOCK DESIGNS 
A (v,6,r,k,A) block design is a family of subsets X 1,X9,...,X, of a 
v-element set X, satisfying the following conditions: 
(i) Each X; has k elements, 1 <i < b. 
(11) Each 2-element subset of X is a subset of exactly \ > 0 of the sets 
>. Crema, CF 
Properties (iit) and (tv) follow immediately from (Z) and (iz). 
(uit) Each element of X is in exactly r of the sets X1,...,Xp. 
(iv) r(k — 1) = X(v — 1) and bk = ur. 
For example, the following is a (7,7,3,3,1) block design. 


X; = {ii + 1,1 + 3} (mod 7) for 1 <i <7. 


The reader is referred to Refs. 5 and 6 for the existence and con- 
struction of various classes of block designs. 


IV. THREE-STAGE ZONE-BALANCED NETWORKS 

Let X1,...,Xp, be a (v,b,7,k,A) block design. We will construct a 
three-stage zone-balanced network having internal graphs containing 
r paths and external graphs containing \ paths (see Fig. 7a and b), and 
having uv switches in the first or third stage. The input terminals of the 
same zone go to the same switch. Let y1,... ,y,) be switches in the first 


(b) 


oO 


(c) 


Fig. 7—(a) A zone-balanced network constructed by using (7,7,3,3,1) block design. 
(b) The internal graph. (c) The external graph. 
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stage (the primary part), each of which has r output lines. Let z,... ,z, 
be switches of size k X k in the second stage (the secondary part). Then 
we connect y; to z; if and only if i is an element of X; (see Fig. 7 for the 
(7,7,3,3,1) block design). The connection between the second and the 
third stage is a mirror image of that between the first and the second 
stage. It is easily verified that the resulting network is zone-balanced and 
has internal and external graphs as shown in Fig. 7b and c, respectively. 
This is the basic model of zone-balanced networks. 


V. MULTISTAGE ZONE-BALANCED NETWORKS 


In this section, we give explicit constructions for various types of 
multistage zone-balanced networks. Roughly speaking, we construct 
these multistage zone-balanced networks by replacing each switch in 
the first or the last stage of the three-stage zone-balanced network de- 
scribed in Section III by a distribution network. The internal graphs in 
these zone-balanced networks depend on the switch sizes in the distri- 
bution networks. Suppose a distribution network has s stages and has 
switch of size n; X m; in stagez, 1 <1 <s. Then the internal graph in the 
zone-balanced network has mjmp. .. ms paths as shown in Fig. 8a. The 
external graph depends heavily upon the linking pattern between the 
distribution networks in the primary part (or tertiary part) and the 
switches in the central stage. In general, it would be desirable to have 
the external graph as “spread-out” as possible. We will not define 
“spread-out” rigorously here. For example, the graph in Fig. 10c is more 
spread-out than the graph in Fig. 1c, although they contain the same 
number of paths. It can be shown?’ that the more spread-out the channel 
graph is, the less the traffic congestion will be. The external graph in the 
zone-balanced network we construct will be as shown in Fig. 8b. 

In this section, we restrict ourselves to the case in which the number 
of input terminals is the same as the number of output terminals and 
the zone sizes are equal, ie., |Z] = |Q|], 2 =Iqu...uly, Q= Qyu...UQy 


Se : 


(a) (c) 


we oa A general multistage internal graph. (b) A general multistage external graph. 
c) A path. 
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and |J;| = |9;|, 1 <ij <v. An input terminal u, uel;, and an output 
terminal uv’, u’eQ;, are said to be in the same zone if and only if i = j. 

We first present a simple technique, called a Type 1 construction, for 
construcing zone-balanced networks. The distribution network M, in 
the primary part has the property that every switch in the last stage has 
r output lines, i.e., m; = r. Let us assume M, has g switches in the last 
stage. Thus, M, has gr output lines. We will construct a zone-balanced 
network with internal graphs containing gr paths and external graphs 
containing qd) paths (see Fig. 9a and b) as follows. 

Let the central stage consist of bq switches of size k X k. Then the q’th 
switch of the v’th copy of M, in the primary part is connected to the 
((b’ — 1)q + q’)th switch in the central stage for any b’ with.v’eX,’. We 
connect the central part and the tertiary part in the same way (sym- 
metrically) that the primary and central parts are connected. 

For example, using the (3,3,2,2,1) block design X = {1,3}, Xo = {1,2}, 
X3 = {2,3}, we obtain the network shown in Fig. 10. We note that the 
networks in Fig. la and Fig. 10a have the same number of crosspoints, 
but the channel graph in Fig. 10c is more spread-out than the channel 
graphs containing r paths, provided a (v,b,r,k,A) block design exists 
in Fig. la, though their ‘“‘cost”’ (number of crosspoints) are the same. 

Now we give a construction (Type 2) of another class of zone-balanced 
networks which has external graphs containing a path and internal 
graphs containing r paths, provided a (v,},r,k,A) block design exists 
where A = 1. The primary part consists of v copies of a complete distri- 
bution network which has r output lines. Thus, r is the product 
mM,mMy...ms;. The central stage consists of b switches of size k X k. We 
will connect the r output lines of the distribution network to the switches 
in the central stage as follows: One of the output lines of the v’th copy 
of the distribution network will be connected to the b’th switch in the 
central stage for any b’ such that v’ is contained in the block X,’. In Fig. 
11, we have an example which is constructed by using a (9,12,4,3,1) block 
design. It can be easily verified that the internal graph is as shown in Fig. 
8a and the external graph is as shown in Fig. 8c. 





m2 


my 





(a) (b) 
Fig. 9—(a) An internal graph. (b) An external graph. 
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(b) 





(a) (c) 


Fig. 10—(a) A type-1 zone-balanced network constructed by using (3,3,2,2,1) block 
design. (b) The internal graph. (c) The external graph. 





: Fig. 11—A type-2 zone-balanced network constructed by using (9,12,4,3,1) block 
esign. 


Type 3 zone-balanced networks are variations of Type 2 zone-balanced 
networks. The primary part consists of copies of a distribution network 
which is not necessarily a complete distribution network. In Figure 12a, 
we have an example which is constructed by using (3,3,2,2,1) design. Its 
internal and external graphs are shown in Figs. 12b and c, respectively. 
The distribution network usually has rw output lines for some integer 
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(b) 


—— e.g 


(c) 


Fig. 12—(a) A type-3 zone-balanced network constructed by using (3,3,2,2,1) block 
design. (b) The internal graph. (c) The external graph. 


w. The central stage consists of bw switches which can be partitioned 
into b subsets, each of which has w switches. Among the rw output lines 
of the v’th copy of the distribution network, there are w output lines 
which are connected to the w switches, respectively, in the b’th subset 
if X,’ contains v’ and any input line of the distribution network does not 
have access to more than one output line of these w output lines. The 
example in Fig. 12 more or less reveals the scheme for constructing 
zone-balanced networks of this type. The detail of the construction is 
omitted. 

Type 4 zone-balanced networks are generalizations of Type 2 zone- 
balanced networks. A Type 4 zone-balanced networks has external graph 
containing \w paths and internal graphs containing rw paths, provided 
a (v,b,r,k,A) block design exists. The primary part consists of v copies 
of a complete distribution network which has s stage and has switches 
of size n; X m; in ith stage. Thus, rw is the product m,mp, .. m;. Since 
we want the external graph as spread-out as possible, we will restrict 
ourselves to use block designs with \ = 1. (By using block design with 
\ > 1, the subsequent scheme for constructing Type 4 zone-balanced 
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network can be applied with some modification. However, the resultant 
networks will usually have internal graphs which are not as spread-out 
as possible.) The internal graph and the external graph are as shown in 
Fig. 8b. Thus, we will choose w in the form mm». .. mg_;d where d di- 
vides mg. The connection between the primary part and the central part 
is more complicated than that in the Type 2 construction. However, it 
can be explicitly specified by the following simple method. 

Let f5(t1, . . . ts—130”) denote the (q + 1)th outlet lines of the (7; + igm, 
+... + ls-ymyz...Ms—2 + 1)th switch in stage s of the v’th copy of 
Ms. 

The central stage of the zone-balanced network consists of bw switches 
of sizek X k. We define 


f¢(i1, ... ,1g3b’) to be the 
(ty + igmy +... +gmy...Mg-1 + (b’— 1) my... Mg-1d + 1) thswitch, 


where 
O<ig<m,1Sq<a0 Sig <dand0<b’ <b. 
Let X1,X9,...,X, denote the sets of a (v,b,r,k, A) block design with 
\ = 1. For any element yeX = u;X;, we say the ith y-set is X; if X; is the 
ith set containing y, ie., [{XgiveXg,1 Sq Sj} =i. 
First, we consider the special case when d = m,. 
f§(i1, ...,ts—150’) is connected to 
Fe (ir, ce sigiD") 
if v’eX,’ and the 
ot FlapomMasri +. 2. isn iMate ss Meno 
+ gMq41.-- Ms—1 + 1)th v’-set 
is Xp’. 
Now, if d is a proper divisor of mg, the above scheme has to be modi- 


fied slightly. Note that i, can be written asi, +i,d where 0 <i, <d. 
Then we have: 


fo (1, . .- ,ts—1j0’) is connected to 
fe (,. os jla=ttai0’) 
if v’eX,’ and the 


(i, ae lg+1d Gee ls—1Mq41 --- Ms-92 
+ qdmg+1...Ms—1 + 1)th v’-set 


is X 4. 
We note that the fj (or f°) is just a digital expression for address as- 
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signments of the output lines in stage s. The last s — a digits (i.e., 
lg, - + sls—1 Of f5(11, . . . ,ts)) are used to find the location of the “block” 
of switches in the central stage and the first a digits are used to specify 
the location of the switch in that block to which the output lines of 
f*(i1, ... ,ts—1) should be connected. Figure 13a gives an example using 
the (13,13,4,4,1) design where X, = {i, + 1,1 + 3,7 + 9} (mod 18). Its in- 
ternal and external graphs are also shown in Fig. 13b and c. 

We note that, by modifying the construction mentioned above, we 
could easily obtain zone-balanced networks having external graphs not 
necessarily as spread-out as possible. For example, let us modify the 
definition of f§, the digital expression for address assignments of output 
lines of the switches in stage s. Instead of using the last a digits, we use 
the first a digits to assign the location of the “‘block’’ of switches in the 
central stage and use the last s — a digits to specify the location of the 
switch in the “block” to which the output line should be connected. The 
resultant zone-balanced network then has the external graph which is 
the least spread-out channel graph among all possible graphs having the 
same number of paths. In general, we can use arbitrary x digits to specify 
the location of the block (as long as the necessary condition on divisibility 
_is satisfied) to obtain zone-balanced networks having various external 
graphs. 

We also note that it is possible to derive a generalized version of Type 
3 zone-balanced networks using incomplete distribution networks. 
However, it is more complicated and has more constraints than Type 
4 construction. We shall not discuss it here. 


Vi. ZONE-BALANCED NETWORKS OF MORE GENERAL TYPES 


We note that the right half of a zone-balanced network is itself an 
incomplete distribution network. These incomplete distribution net- 
works, called BD-distribution networks, seem to distribute traffic more 
evenly and have richer combinatorial properties than the cyclic incom- 
plete distribution networks described in Section II. One of the obvious 
reasons is that a BD-distribution network together with its mirror image 
gives a zone-balanced network, whereas a cyclic incomplete distribution 
network does not. 

Suppose the set of input and output terminals can be partitioned into 
a number of zones which can themselves then be partitioned into several 
areas such that requests for connecting terminals in the same area (zone) 
are more likely than those for connecting terminals in different areas 
(zones). In such a network, there usually are four channel graphs, say 
G1, Go, G3, G4, such that G(u,u’) is isomorphic to Gy, if u and uv’ are in 
the same zone and area; G(u,u’) is isomorphic to G, if u and u’ are in 
different zones and areas; G(u,u’) is isomorphic to Gg if u and uv’ are in 
the same zone but different areas; G(u,u’) is isomorphic to G3 if u and 
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(c) 


(b) 
Fig. 13—(a) A type-4 zone-balanced network constructed by using (13,13,4,4,1) block 
design. (b) The internal graph. (c) The external graph. 


u’ are in the same area but different zones. These networks will be called 
multizone-balanced networks, and can be built by schemes similar to 
those described in Section V. A multizone-balanced network can also 
be viewed as a combination of three parts. However, the primary part 
usually consists of copies of a BD-distribution network. ° 
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be 
. 


(a) (b) 


be 
be 


(c) (d) 


Fig. 14—Some channel graphs for multizone-balanced networks. 


_A simple model of a five-stage multizone-balanced network can be 
described as follows 

Let X1,...,Xp, be a (U1,61,r1,k1,A1) block design and Yj,...,Y5, be 
a (V9,b9,re,k2,A2) block design. The distribution network M, for each 
zone is exactly the right half of the three-stage zone-balanced network 
described in Section IV, and the number of output lines of a switch in 
the last stage of M, is ro. The primary part consists of v2 copies of M,, 
and the central part consists of switches of size ky X Ro. The ith switch 
of the jth copy of M, is connected to the ((b’ — 1)b; + 1)th switch if 7 is 
an element of Y;,’. Again, the tertiary part is connected to the central 
part in the same way (symmetrically) that the primary and central parts 
are connected. 

It is easy to verify that the channel graphs G1,Go,G3,G4 are as shown 
in Fig. 14. 

We could modify the above multizone-balanced network by replacing 
the switches in the primary part or the tertiary part by distribution 
networks using the same method used in Section V. However, we will 
not discuss this here. . 

Suppose the number of input terminals and the number of output 
terminals are not equal, say |Q| is a multiple of |J|. We can still construct 
zone-balanced networks by combining copies of zone-balanced networks 
which have the same number of input and output terminals. 

Hagelbarger? first proposed the use of block designs for constructing 
switching networks in 1973. Some other techniques for constructing 
various classes of networks have been investigated in Refs. 10 and 11. 
Hopefully, more structures in combinatorics and graph theory can be 
employed as useful techniques to construct interesting classes of 
switching networks. 
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The Construction for Symmetrical 
Zone-Balanced Networks 


By F. K. HWANG and T. C. LIANG 
(Manuscript received January 27, 1978) 


For many real networks, the input and output switches can often 
be partitioned into subsets called zones such that switches in the same 
zone have a certain community of interest and are more likely to request 
a connection. Such a network will be called a zone-balanced network 
if, among other regularity conditions, the channel graph between an 
input switch u and an output switch v is either isomorphic to a graph 
G, if uand vare in the same zone, or isomorphic to a graph Go if oth- 
erwise. In this paper, we continue the study of using balanced incom- 
plete block design for the construction of zone-balanced networks. We 
introduce some new methods to construct a wide class of such networks, 
which include some previous constructions as special cases. 


Il. INTRODUCTION 


The topology of a switching network can often be represented by a 
graph by taking switches as vertices and links as edges. By this repre- 
sentation, a multistage (switching) network is a graph the vertex-set 
of which can be naturally partitioned into s subsets Vj,...,V; and the 
edge-set into s — 1 subsets Fj,...,E;-1, for some number s, so that E; 
connects V; to V;41. (We do not allow multiple edges between two ver- 
tices.) Vertices in V, correspond to the input switches of the network 
and vertices in V, correspond to the output switches. Let ueV; and veV,. 
Then the channel graph G(u,v) is the union of all paths connecting u 
to v in the network. A multistage network is said to be regular if every 
vertex in V; has the same number of edges in E;_; and the same number 
of edges in £;. A regular multistage network is balanced if the channel 
graphs G(u,v) over all weV; and all veV, are isomorphic. 

For many real networks, the input and output switches can often be 
partitioned into subsets called zones such that switches in the same zone 
have a certain community of interest and are more likely to request a 
connection. (In this paper, we are concerned only with connection be- 
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tween an input switch and an output switch.) Such a network will be 
called a zone-balanced network if it is regular and there exists two graphs 
G, and G¢ so that G(u,v) is isomorphic to G, if u and v are in the same 
zone and G(u,v) is isomorphic to G2 if not. G; and G2 will be referred to 
as the intrazone and the interzone channel graphs, respectively. A 
zone-balanced network is said to be symmetrical if it is symmetrical with 
respect to the center stage or the two stages in the middle. 

A balanced incomplete block design (abbreviated as BIBD) with pa- 
rameters (v,0,r,k,\) is a family of blocks, with each block being a k-subset 
of the set {1,2,... ,v}, satisfying the following properties: 


(1) Every element in the set {1,2,. . . ,v} appears in exactly r blocks. 
(11) Every pair of elements in the set {1,2,. . . ,»} appears together in 
exactly \ blocks. 


BIBDs have long been a favorite subject for mathematicians and 
statisticians. The reader is referred to Ref. 1 for the existence and con- 
struction for many BIBDs. The use of BIBDs for constructing zone-bal- 
anced networks was first studied in Ref. 2. Some further constructions 
were given in Ref. 3. In this paper, we give some methods for such con- 
structions. The zone-balanced networks constructed previously, as well 
as in this paper, are all symmetrical. 


ll. SOME PRELIMINARY RESULTS 


A zone-balanced network is called canonical if each zone consists of 
a single input switch and a single output switch. Therefore, a CZBN 
(canonical zone-balanced network) can be viewed as a prototype for a 
full-fledged network with the same interzone and intrazone channel 
graphs. The mechanism for expanding a CZBN into a full-fledged network 
is the operation of “parallel expansion,” which was first introduced by 
Takagi* and by Timperi and Grillo.® For an s-stage network N, a (k,/) 
left (right) parallel expansion means taking k copies of N and identifying 
their subgraphs from stage j to stage s (from stage 1 to stage J). Figure 
1 gives some examples of parallel expansion. It is clear that parallel ex- 
pansion preserves the isomorphisms of the interzone and intrazone 
channel graphs. 

Next we introduce a method which we will use later to describe the 
connection between switches in two adjacent stages. To use this method, 
every switch in the two adjacent stages should be labeled by a subset of 
a given set. Then two switches in the adjacent stages should be connected 
if the label of one is contained in the label of the other. This type of 
connection will be called a labeled-subset connection. 
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Go 


G, A(2, 2) LEFT PARALLEL OF G, 
G3 G4 
A (2, 3) LEFT—PARALLEL OF G, A (2, 2) RIGHT—PARALLEL OF G3 





=. 


Fig. 1—Examples of parallel expansion. 


lll. A RECURSIVE CONSTRUCTION FOR CZBN 


A 2-stage CZBN with v zones is necessarily a v Xv complete bipartite 
graph; hence, its construction is trivial. We now give a construction for 
a 3-stage CZBN (noting that a 3-stage channel graph is uniquely deter- 
mined by its number of paths). 


Theorem 1: Suppose that a (v,b,r,k,\)-BIBD exists. Then we can con- 
struct a 3-stage CZBN with v zones which has r paths in its intrazone 
channel graph and X paths in its interzone channel graph. 


Proof: Take b switches of V2 and label each of them by a distinct block 
of the given BIBD. Take v switches of V;(V3) and label each of them by 
a distinct element of Z = {1,2,...,v}. Apply a labeled-subset connection 
between V2 and V;(V3). It is easy to verify that the resulting network 
is the one specified in Theorem 1. 


Example 1: Let the given BIBD have parameters (7,7,3,3,1) and have 
blocks (1,2,4), (2,3,5), (3,4,6), (4,5,7), (5,6,1), (6,7,2), and (7,1,3). Figure 
2 gives a 3-stage CZBN with 7 zones. 

We how give a recursive construction for a symmetrical s-stage CZBN 
for s = 4. 


Theorem 2: Suppose that an s-stage CZBN with k zones exists which 
has G, and G2 as its intrazone and interzone channel graphs. Fur- 
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Fig. 2—A 3-stage CZBN. 


thermore, suppose that a (v,b,r,k,\) BIBD exists. Then there exists an 
(s + 2)-stage CZBN with v zones which has the channel graphs as shown 
in Fig. 3. 


Proof: Let N be the given s-stage CZBN and assume that every input 
(output) switch of N is labeled by the zone it belongs to. Take b copies 
of N and let N; denote the ith copy. Replace the k zones in N; by the k 
elements in the ith block of the given BIBD. Take v switches of V;(V3) 
and label each switch by a distinct element of the set Z = {1,2,.. . ,v}. 
Apply a labeled-subset connection between V;(V3) and the input (out- 
put) switches of the b copies of N. It is easy to verify that the resulting 
network is indeed the one specified in Theorem 2. 


Corollary: Suppose that a (v,b,r,k,\) BIBD exists. Then we can construct 
a 4-stage CZBN with v zones such that its intrazone channel graph 
consists of r disjoint paths and its interzone channel graph consists of 
\ disjoint paths. 


INTRAZONE INTERZONE 





Fig. 3—Channel graphs for Theorem 2. 
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Example 2: Using the same BIBD as in Example 1, we obtain the 4-stage 
CZBN as shown in Fig. 4. 
IV. A CONSTRUCTION FOR CZBNS USING A BALANCED PARTITION 

OF BLOCKS 


Consider a (v,b,r,k,\) design, and let F; denote the subfamily of blocks 
containing element 1. A partition of F; is said to be balanced with pa- 
rameters (p,d) if the following conditions are satisfied: 

(i) F; is divided into p disjoint parts such that each part consists of 
r/p blocks. 
(it) Exactly d + 1 distinct elements appear in each part. 





INTRAZONE INTERZONE 


ee tien 


Fig. 4—A 4-stage CZBN. 
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(iii) For any element i’ ¥ i contained in part j, the number of blocks 
in part j containing 1’ is a constant. (From (i) and (ii), this constant must 
also be independent of /.) 

We say F; has a 2-step balanced partition with parameters 
(p1,d1,P2,d2) if F; has a balanced partition P),...,P,,, with parameters 
(p;,d,) and each P; has a balanced partition with parameters (po,d9). 
Similarly, we can define a t-step balanced partition of F; with parameters 
(P1,d1,P2,do,... ,Pt,d). 

Note that any t-step nested partition of a set N induces a partial or- 
dering which can be represented by a (¢ + 2)-level rooted tree. Suppose 
that N has n elements. Then the first level of the tree corresponds to the 
crudest partition, namely, a single node representing the set N itself, and 
the (t + 2)-level of the tree corresponds to the finest partition, namely, 
n nodes each representing a single element of N. The t intermediate 
levels of the tree correspond to the t partitions sequentially. By taking 
two copies of this tree and identifying their nodes at the (t + 2)-level, 
we obtain a (2t + 3)-stage symmetrical network. This mapping from a 
nested partition to a multistage network is critically used in the following 
theorem. 


Theorem 3: Consider a (v,b,r,k,X\) BIBD and let F; be the subfamily of 
blocks containing the element t. Suppose that for each F,, i = 1,2,. . . ,v, 
there exists a t-step balanced partition with the parameters 
(p1,d1,P2,d9,...,Dzd,). Then there exists a (2t + 3)-stage ((2t + 4)- 
stage) CZBN which has channel graphs as shown in Fig. 5: (q = 
r/Il}.,p;). (To obtain the channel graphs for the (2t + 4)-stage CZBN, 
replace each vertex in the center stage by the graph O-O.) 


INTRAZONE INTERZONE 





Fig. 5—Channel graphs for Theorem 3. 
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Proof (only for the case that n is odd): For each F;, i = 1,. .. ,v, construct 
a (2t + 3)-stage symmetrical network by using the given t-step balanced 
partition. The union of these v networks (they overlap at the center stage, 
since the F;’s overlap) yields a CZBN with channel graphs as specified 
in Fig. 5 with c = 1. Taking c copies of these networks and identifying 
their first stages and last stages, we obtain the desired CZBN. That the 
constructed network is “balanced” is a consequence of the partition being 
balanced. 


Corollary: Suppose a (v,b,r,k,1) BIBD exists and II}.,p; divides r. Then 
a CZBN with channel graphs as specified in Fig. 5 exists. 


Proof: With \ = 1, any partition which satisfies condition (1) of a bal- 
anced partition is a balanced partition. The same is true for a t-step 
partition. Therefore, when IIj.,p; divides r, then a t-step balanced 
partition with parameters (pj,... ,p,) always exists (the parameters d)s 
are determined by pys). 


Note that by applying Theorem 2 several times to the network con- 
structed in Theorem 3, we can obtain CZBNs with various types of 
channel graphs. In particular, we obtain the following: 


Theorem 4: Suppose that a sequence of BIBDs with parameters 
(v;,b;,r;,kj,A;), J = 1,2,. ..,m exists. Furthermore, suppose k; = vj+1 for 
j= 1,2,...,m—1, \m = 1, and IIj-p; divides r,,. Then there exists a (2t 
+ 2m + 1)-stage ((2t + 2m + 2)-stage) CZBN which has channel graphs 
as shown in Fig. 6: (q = rm/Tj21p1)- 


Proof: Use the (Um,bm,'m,km,;\m) BIBD to construct a (2t + 3)-stage 
CZBN from Theorem 3. Then apply Theorem 2 m — 1 times. 

Note that, if we take c copies of each k out of v combination, we obtain 
a (v,b,r,k,\) BIBD with b = c(#), r = c(Zz}) and \ = c(}_3). By setting k 
= v, it is clear that a (u,r,r,v,r) BIBD always exists. The zone-balanced 
networks constructed in Ref. 3 are thus seen to be special cases of the 
networks specified in Theorem 4 by setting A; =r; for j = 1,2,...,m — 
1. (The conditions that \ = 1 and II{.,p; divides r,, are not explicitly 
stated in Ref. 3, but a check with the author of Ref. 3 has verified their 
necessity.) 


V. A GENERALIZATION 


We can generalize the definition of zone-balanced network to partially 
zone-balanced network in which every pair of zones is classified into one 
of the k associate classes. The channel graphs of all intrazone pairs of 
the th associate are isomorphic to a graph G; regardless of which pair 
is chosen. The number of the ith associates of a given zone should be 
independent of which zone is chosen. Just as balanced incomplete block 
designs are a natural tool for the construction of zone-balanced networks, 
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INTRAZONE 





Fig. 6—Channel graphs for Theorem 4. 


we can use partially balanced incomplete block designs to construct 
partially zone-balanced networks, and results similar to those given in 
this paper can be obtained. However, the partially balanced incomplete 
block design is really too strong for our construction, since we do not 
require that for every pair of zones X and Y of the ith associate, the 
number of zones which are the jth associate of X and the kth associate 
of Y should be independent of X and Y. This suggests that some design 
weaker than the partially balanced incomplete block design should be 
studied for this purpose. 

After the completion of this paper, we learned that the author of 
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Ref. 3 had just revised her paper into a more complete and general ac- 
count.° However, the main difference between our construction and her 
construction remain as follows: (i) Her construction uses only one BIBD, 
while ours uses many BIBDs sequentially. (ii) The method of using bal- 
anced partition of blocks is unique in our construction. 
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The Reliability of 302A Numerics 
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To assess the long-term reliability of Western Electric 7-segment 
302A numerics, accelerated forward bias-aging (10 mA, 60° and 125°C) 
and thermal cycling (—40° to 125°C) experiments have been performed. 
In treating the bias-aging data, we strictly differentiate between LED 
chip and digit failure and show that the times to chip failure—defined 
here as the times required to reach a normalized efficiency of r = n/no 
= 0.5—are lognormally distributed. The analysis of the data was fa- 
cilitated by a novel computer-graphics routine which provides for each 
digit on test a bar-by-bar description of the time-evolution of r. The 
median life and standard deviation at 125°C and 10 mA for chips are 
2400 hours and ~0.4, respectively. Furthermore, we find that the failure 
distribution for digits can be obtained from the chip distribution by a 
simple probabilistic consideration. The good accord demonstrated 
between the experimental data and the theoretical curve derived from 
the diffusion theory of red GaP LED degradation indicates that the 
predominant mode of degradation in bias-aging of 302A devices is that 
of the LED. 

The thermal cycling response of 302A numerics encapsulated in 
Hysol 1700 epoxy is excellent. Similar to bias-aging, the chip failure 
distribution is lognormal, and chip and digit failures are interrelated 
by the probabilistic law. For a temperature excursion between —40° 
and 125°C, the median number of cycles to failure is 23,350 for chips 
and 300 for digits. The cause of failure is identified by electrical testing 
as open wire bonds. 

Finally, the acquired data permit the estimation of the mean times 
to failure (MTTF) and failure rates beyond infant mortality of 302A 
numerics in a specific application such as Transaction telephone 
sets under realistic operating conditions. The overall reliability of these 
devices is excellent, characterized by an MTTF of 10® hours and a 
maximum failure rate of less than 1 FIT for bias-aging over a 20-year 
service life. Failures due to broken wires are estimated to yield an MTTF 
of 10° hours and a failure rate of S4 FITs at 20 years of service. 
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I. INTRODUCTION 


The 302A and 302C red numeric displays designed* by Bell Labora- 
tories and manufactured by Western Electric—Reading consist of 8 red 
GaP chips arranged as 7 segments into a single digit and a right-hand 
decimal point. Each chip is mounted in a separate reflector and con- 
nected with either a common cathode (302A) or common anode (302C). 
In both cases, the device is encapsulated in Hysol 1700 epoxy.? 

In general, the effects of long-term thermomechanical, environmental, 
and electrical operating conditions on device performance are deter- 
mined concurrently with device development. The major objective of 
this work was to obtain reliability information on 302A numeric display 
devices by means of forward bias-aging and thermal cycling at acceler- 
ated rates, which enables us to predict their long-term behavior, beyond 
infant mortality, when used in typical Bell System applications. 

First, we provide an outline of the experimental procedures employed 
in the acquisition of failure data by high temperature forward bias-aging 
and wide temperature-range thermal cycling. Then, the handling, 
treatment, and graphic display of the copious amount of information 
generated by the bias-aging experiments are discussed. Second, the time 
evolution of the relative luminescent efficiency (r = nt/no) of 302A nu- 
merics during bias-aging is compared to the predictions of the diffusion 
theory of red GaP degradation.! Moreover, the failure distribution is 
established for the entire sample of chips. Next, we determine the failure 
distribution for thermal cycling and attempt to locate the thermome- 
chanical weak points of a bonded chip. Finally, we discuss the evaluation 
of the mean time to failure (MTTF) by a variety of criteria (i.e., alternative 
definitions of chip and digit failure). It is shown on the basis of proba- 
bilistic arguments that the failure distribution for digits can be calculated 
from the chip distribution. Likewise, in the case of thermal cycling, chip 
and digit distributions are convertible. The parametric values under 
normal operating conditions can be extrapolated from accelerated failure 
data by means of semi-empirical correlations. When they are combined 
with a device utilization model for a specific application, the MTTFs and 
failure rates induced by the forward bias and temperature cycling can 
be readily estimated. 


ll. EXPERIMENTAL 
2.1 Bias-aging 


Seventeen 302A numeric devices were selected for long-term elevated 
temperature forward bias-aging. Prior to aging (at t = 0), the light output 
of every segment was measured at 25°C by a standard technique.? 


* The device was designed by C. R. Paola. 
t A product of the Hysol Division, Dexter Corporation. 
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Briefly, the output was determined segment by segment over a wide 
range of pulsed test current inputs between 1 and 100 mA (including 2, 
5, 10, 20, and 50 mA). At each pulsed current, the duty cycle was adjusted 
to assure a constant 1 mA dc average current to minimize junction 
heating. The segment electroluminescence was detected by a PIN 10 
diode and its output after amplification was handled by an appropriate 
software program on an HP 9830A minicomputer to yield a table of in- 
formation stored on a computer file for all devices under testing.* The 
table contains the light output of each segment of a given device mea- 
sured over the entire range of measuring currents employed. The lowest 
and average initial light outputs were 0.021 and 0.035 millicandella/mA, 
respectively. 

Following initial testing, the 17 devices were split into two groups (9 
and 8 units in each group). One of the groups was aged at 60°, the other 
one at 125°C, in ovens continuously purged with filtered No. The devices 
were placed in trays, each holding three devices, connected to power 
supplies providing 10 mA dc forward bias. Periodically, all the devices 
were removed from the ovens to determine the effect of bias-aging on 
light output. Before performing the measurements by the above-de- 
scribed procedure, the devices were allowed to cool for two hours to 
25°C, 


2.2 Thermal cycling 


Ten 302A numerics were subjected to continuous temperature cycling 
without bias in a controlled-environment chamber. Each cycle consisted 
of cooling the devices in the chamber from room temperature (~25°C) 
to a cold dwelling point at —40°C, then reversing to a warm dwelling 
point at 125°C, and finally returning to ~25°C. At each temperature 
extreme, the dwelling time was about 20 minutes. The cooling and 
heating rates never exceeded 5°C/min. 

The devices were mounted in ceramic sockets attached to a combi- 
nation aluminum and phenolic test fixture through which electrical 
connections could be made for periodic checks. As a result of progressive 
thermal cycling, dark segments could be visually observed at the stan- 
dard forward current of 10 mA de. All the defective chips were faulty at 
both temperature extremes as well as at room temperature. For the 
duration of the first 100 cycles, checks were made at about every 5 cycles; 
thereafter, the test interval was increased to approximately 25 cycles. 


* The automated test facility was developed by J. W. Mann. 
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ill. DATA HANDLING PROCEDURE 


Whenever the degraded light output is measured, each device in the 
sample is characterized by 8 segments X 8 electrical values (including 
the decimal point and the device voltage at 10 mA). For a total sample 
of 17 devices, as many as 1088 numbers are acquired per test at all current 
levels. Obviously, computer storage, retrieval, and treatment of the data 
together with a suitable graphic display are required for detailed analysis. 
Therefore, a time-sharing program has been developed which plots the 
relative electroluminescent efficiency in a unique manner. First, the 
program LED/EFFCAL reads the permanently stored file of raw data 
generated by each light output measurement of any t. A short terminal 
dialog permits the user to name the type of device involved because, in 
addition to the 302A, the program is also applicable to other classes of 
numerics as well as to a group of 10 discrete LED chips. Moreover, one 
can specify the aging temperature and room temperature test current 
and one of the two plotting scales. Then, a file is written which includes 
the device identification number, aging temperature, total accumulated 
aging time, test current, and the number of devices in test under listed 
conditions. Moreover, an array is created from the raw data which, for 
every digit of each device,* contains, segment by segment, the relative 
efficiency as a function of the elapsed aging time. 

As shown in Fig. 1, in accord with the accepted convention, the seg- 
ments are designated by the alphabetic codes A, B ... G. For the A 
segment of the ith device, the relative efficiency at t is given by 

l 

UE. (¢) (1) 
n'a (0) 

where the conversion factor relating light output to electroluminescent 
efficiency 7 was cancelled. The array includes r‘, for each time the light 
output has been measured. In addition, it may be interesting to know 
the relative degradation of any segment in comparison with the mean 
value. For this reason, we devised the following statistical indicators: 

(1) Device or digit mean, 7; 





r, = 








7 ; 
oie 
F; = _ D) 
r 7 (2) 
(it) Grand mean, F 
n 7 n 
2 Pu ad Pj 
—  t=l1A=1 i 3 
ree qn n (3) 


where n is the number of devices tested under identical conditions. 


* Although in the case of a 302A numeric each device corresponds to a digit, allowance 
is made in the program for numerics which consist of as many as 4 digits per device. 
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Fig. 1—Top view of a 302A device showing letter designation of segments. 


(tit) Standard deviation—upper and lower confidence limits of the 
grand mean at ~68-percent confidence level, 7,, and Fg 
n 7 . m 1/2 
(2 2 Ga) 
i=1 A=1 


r,=fr —— (4a) 


and 


n ; 1/2 
(2 = Cia) 
Fp =F -— (4b) 
7n 
At the beginning of the file written for a given set of test conditions (i.e., 
aging temperature and current) 7, 7,, and 7 are listed in a time sequence. 
At the end of the file, we find 7; as a function of time for each digit. 

A batch program named LED/EFFPLT accepts these arrays to produce 
hard-copy plots of r versus time either using the rapid output STARE 3 
system or the FR80 microfilm plotter, especially suitable for white prints 
or viewgraphs. The two options for scales are log;or versus square-root 
of time and r versus log;otime. In Fig. 2, a computer-generated graphic 
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ig. 2—Computer-generated plot of logio normalized efficiency (r = n/no) versus time 1/2 
hee for a 302A device aged at 125 °C and 10 mA. The dashed, dash-dot, and dot-dot 
lines are the digit mean [eq. (2)], grand mean [eq. (3)], and standard deviation of grand 
mean [eq. (4)], respectively. 


output is presented for a 302A device aged at 125°C and 10 mA dc and 
measured at 10 mA in the logior versus Vt projection. Note that, for each 
segment of the digit, the symbol is its alphabetic designation in the order 
shown in Fig. 1. When overlap of the letters interferes with clarity, a 
slight horizontal displacement of the symbol along the time axis has been 
introduced into the plotting routine. In addition to the discrete r values 
of the individual bars, we also display the device mean, grand mean, and 
its lower and upper confidence limits by continuous dashed, dash-dot, 
and dot-dot lines, respectively. In Fig. 3, the data for the same device 
are presented using the r versus log t scale option. It can be readily seen 
that, at the chosen testing intervals, the former scale provides an evenly 
spread distribution of points at long aging times, while the latter (r vs. 
log t) achieves well-separated spacings at short times. Up to 3200 hours, 
the normalized efficiency of the 302A is apparently independent of time 
when bias-aged at 60°C. This is shown in Fig. 4 on a logyor versus Vt plot. 
Although the r scale is magnified here compared to Fig. 2, it does not 
appear to illustrate more than measurement fluctuations. Of course, this 
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Fig. 3—-Computer-generated plot of normalized efficiency (r = n/no) versus logo time 
for a 302A device aged at 125°C and 10 mA. The dashed, dash-dot, and dot-dot lines are 
the digit mean leq. (2)], grand mean [eq. (3)], and standard deviation of grand mean [eq. 
(4)], respectively. 


is consistent with the activation energy of red GaP LED degradation 
(0.7eV)® which leads to very little change in r at 60°C in the first.few 
thousand hours. 

These detailed computer-generated figures are very handy in a rapid 
assessment of numeric degradation and also as an intermediate step for 
further analysis. In particular, one can immediately see to what extent 
an individual chip departs from the digit and grand means. In addition, 
as shown in the next section, the MTTF of the failure distribution can 
be readily evaluated from the plots under a variety of failure defini- 
tions. 
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g. 4—Computer-generated plot of logig normalized efficiency (r = n/no) versus time !/2 
thoes for a 302A device aged at 60°C and 10 mA. The dashed, dash-dot, and dot-dot 
lines are the digit mean [eq. (2)], grand mean [eq. (3)], and standard deviation of grand 
mean [eq. (4)], respectively. 


IV. RESULTS AND DISCUSSION 
4.1 Bias aging 
To compare the bias-aging results on 302A numerics with existing 
information involving discrete red GaP LED chips, we have replotted 

from Fig. 3 the grand mean of the normalized efficiency and its standard 
- deviation as a function of logigt in Fig. 5. Superimposed on the same 
figure is the calculated course of degradation at 125°C ambient tem- 
perature and 10 mA stress current (132°C junction temperature). The 
theoretical curve is based on the diffusion theory of red GaP LED deg- 
radation.! 

Recently, the time evolution of nonradiative centers, thought to be 
responsible for the long-term degradation of red GaP LEDs, has been 
modeled. It has been postulated that degradation is due to the diffusion 
and accumulation of an undesirable impurity or point defect through 
the depletion layer as the p-n junction potential, which retards defect 
motion, is reduced by the forward voltage. An explicit analytical ex- 
pression between r and ¢t was derived which provided a good fit to the 
degradation data for discrete diodes obtained at various junction tem- 
peratures and stress currents. The equation for r is of the form! 


r(t) = Z 


Vitro t volley’ ot 
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Fig. 5—Normalized efficiency versus logi9 time for 302A numerics aged at 125°C and 
10 mA. The full line is calculated from the diffusion theory of red LED degradation [eq. 


(5)]. The data points are the grand mean and its standard deviation for all segments of 
the sample (see Fig. 3). 


where ¢ is an infinite series, y is a constant which is proportional (among 
other quantities) to the initial undesirable impurity concentration and 
electron lifetime of a diode lot, and ro reflects the rapid drop in 7 at rel- 
atively short aging times. The quantity @ is related to the diffusivity (Do), 
activation energy (AH,), and stress current (J¢tres) according to 


1 
0 = al stresse S44 a/* y (5b) 


where a is a known constant of proportionality. 

To calculate the degradation curve in Fig. 5 from eqs. (5), we made use 
of the parameters listed in Ref. 1. However, on account of the lot-de- 
pendent properties of y, asmall upward adjustment in its value was re- 
quired to achieve optimum description of the data. Without this change, 
the computed r values would be approximately 10 percent above the 
plotted ones at times in excess of 1000 hours. 

The good agreement seen between the experimental normalized ef- 
ficiencies for 302A numerics and the theoretical curve indicates that 
these red devices do not exhibit failure modes in addition to LED deg- 
radation. This finding is corroborated by the 125°C storage aging of 
numerics which has not discolored the Hysol 1700 encapsulant. Thus, 
it appears that the optical coupling efficiency of 302A numerics is in- 
variant during bias aging. 

Since the normalized efficiency of individual segments visibly deviates 
from the grand mean (Fig. 2), the failure of 302A numerics must be 
distributed by some statistical law. Detailed digit-by-digit failure plots 
similar to Fig. 2 permit the determination of the failure distribution as 
a consequence of bias aging. The following possible failure criteria are 
worthy of exposition in some detail: 
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(t) Chip failure: Whenever any one of the 7n chips in the samples 
reaches r, = 0.5, that time is denoted as the time to chip failure 
(TTCF). 

(it) Digit failure: The time for the first chip in each device to attain 
r', = 0.5 is designated as the time to digit failure (TTDF). 

(iit) Digit mean failure: The time for any digit mean F; [eq. (2)] to 
equal 0.5 is named the time to digit mean failure (TTDMF); 

(iv) Grand mean failure: When the grand mean normalized efficiency 
leq. (3)] F = 0.5, we speak of time to grand mean failure (TTGMF). 
Each one of these quantities possesses a mean except TTGMF. 

The total number of chips in our sample of 8 aged at 125°C is 8 X 7 = 
56. In Fig. 6, we present on lognormal graph paper the time to chip failure 
as a function of cumulative failure percent.?+ The TTCF values were 
taken from computer-generated plots for each 302A digit, identical in 
form with Fig. 2, and then rank-ordered to provide the cumulative fail- 
ure. It should be noted that TTCFs above ~3400 hours were obtained by 
linear extrapolation on the Vt plots.® Hence, the longer the time to chip 
failure is, the less accurate the TTCF value becomes. Fortunately, this 
is probably not important except in the case of the last two points. 

The linearity of the TTCF plot in Fig. 6 indicates that the failure dis- 
tribution for 302A numeric chips is lognormal, as is the case for 1A 
opto-isolators,® which utilize GaP LEDs and also for numerous semi- 
conductor devices.” Least-square analysis provides the following log- 
normal parameters for the chip failure distribution.* 

Ma = IN tma = 7.783 
Og = 0.87, 
where the median life, tm, is 2400 hours. It can be seen that the distri- 
bution is very tight, as o, is quite small. 
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Fig. 6—Time to failure versus cumulative failure percent for 302A numerics aged at 
125°C and 10 mA. The symbols O, OG, and A correspond to failure criteria (7), (it), and (vit), 


respectively (see text). The solid line is a least-square line for chip failure, while the dashed 
line is calculated from eq. (9) for digit failures. 
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The time to digit mean failure as a function of cumulative failure 
percent is also shown in Fig. 6. Each one of the computer-generated plots 
for each device, similar to Fig. 2, yields one value of TTDMF. Because each 
time value is the result of averaging seven normalized efficiencies, the 
standard variation is smaller than for TTCF. However, the median life 
tme = 2200 hours, which is almost the same as for chip failure. 

Perhaps the most important distribution for numerics is the time to 
digit failure, since it is reasonable to assume that if any one chip in a 7-bar 
numeric loses half its efficiency, the displayed numerals may not be 
correctly discriminated by eye. The TTDF distribution is also shown in 
Fig. 6. Its construction from the computer-generated digit-by-digit 
graphic outputs is self-evident. We can see that in comparison with a tg 
of 2400 hours for chips, the median life for digits, t,,.,, is reduced to 1600 
hours with an accompanying decrease in o. In Table I, we summarize the 
median lives obtained by a variety of methods. It can be seen that all the 
values but the median of TTDF are closely spaced. 

However, the chip and digit failure distributions are related by the 
laws of probability. The reliability function for chip failure R, is the 
complement of the plotted cumulative failure Q,, hence at any time t 


Ra(t) = 1 — Qa(t). (6) 
R, expresses the probability that the chip will survive to ¢. If the chip 
failures in the device are independent, then the reliability of the digit, 
Ry, assuming device failure if any one of the segments fails (ry < 0.5), 
is given by® 
Rp (t) = R(t). (7) 
Obviously, the TTDF definition is consistent with eq. (7). The cumulative 
failure function for device failure, Q,, is of the form 


Qo(t) = 1— R(t) = 1—-— Rie). (8) 
Finally, a combination of eqs. (6) and (8) yields 
Qp(t) = 1 - (1 -— Qa(t))%. (9) 


The cumulative device failure function Q, for 302A numerics can be 


Table | — Median lives for a 302A red numeric 
at 125°C and 10 mA 
Method tm (hours) 
Chip failure 2400 
Digit mean failure 2200 
Digit failure 1600 
Grand mean failure* 2300 
Diffusion theory (r = 0.5)* 2200 


* Not a median but a single value. 
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readily obtained from the full line (lognormal distribution) in Fig. 6 and 
eq. (9). The dashed line in Fig. 6 is the calculated Q,. Considering the 
small sample size in terms of digits, the calculated line is a surprisingly 
good representation of the cumulative failure data for digits. The median 
life appropriate for Q, is 1500 hours, which should be compared with the 
empirical result of 1600 hours. Therefore, it matters very little how the 
median lives are computed (chip versus device), as long as the results 
are correctly interpreted. Furthermore, although the median lives ob- 
tained by various criteria nearly coalesce, this may not be true of the 
MTTFs and failure rates as those quantities also involve the standard 
deviations which, according to the slopes over the data in Fig. 6, are quite 
variable. 


4.2 Thermal cycling 


In thermal cycling, chip failure is sudden and manifests itself as a dark 
segment on testing. In analogy with the definition invoked in the section 
on bias-aging, digit failure occurs at the number of thermal cycles at 
which the very first segment fails to light up. In Fig. 7, we present a 
lognormal projection of the number of cycles to failure versus cumulative 
failure percent, both in terms of chip as well as device failure, for 302A 
red numerics encapsulated in Hysol 1700 epoxy. 

Again, as in the case of bias-aging, the failure distribution follows the 
lognormal pattern, as it plots as a straight line on the lognormal graph- 
paper. Least-square analysis yields the following parameters for chip 
failure: 


Hte = In tEmte = 10 and Otc = Beals 


where tmec = 23,350 is the median number of cycles to failure (MCTF). 
The large standard deviation corresponds to a widely spread failure 
distribution. This is also clear from the steepness of the data and the 
least-square (solid) line in Fig. 7. 

The probabilistic equation derived in the previous section on bias- 
aging to calculate the digit failure distribution from information on chips 
is also valid for thermal cycling. Applying eq. (9) to the lognormal line 
for chips, we can calculate the number of thermal cycles to failure as a 

function of cumulative failure function for digits. The dashed line in Fig. 
7 represents the digit cumulative failure function. The digit MCTF is 300. 
It is apparent that there is excellent agreement between the theoretical 
line and the data points for digit failure. 

The effect of increasing the number of digits in the same package is 
also shown in Fig. 7. The dash-dot line is the cumulative failure function 
for a hypothetical 302A-like device consisting of four digits (28 bars). 
It is obvious that, with increasing complexity, there is a drastic drop in 
the MCTF with a somewhat compensating drop in standard deviation. 
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Fig. 7—Number of thermal cycles to failure versus cumulative failure percent for 302A 
numerics, encapsulated in Hysol 1700 epoxy, thermally cycled between —40° and 125°C. 
The solid line is a least-square line for chip failure, while the dashed line is calculated from 
eq. (9) for digit failure. 


To locate the source of thermal cycle failure, a number of devices with 
numerous chip failures were lapped and polished until the gold leads to 
the diodes were exposed. A detailed view of the lapped area of a 302A 
numeric is illustrated in Fig. 8. The polishing of the lens continued to 
the point at which the anode connecting Au wire was broken between 
the lead frame and the die-bonding pad. Due to the ambiguity of viewing 
the location of failure, simple electrical checks were performed. Using 
cathode pins as the common terminal, a fine needle point probe with +2V 
bias was successively applied to the ends of the severed Au wire and to 
the bonding pads by piercing the plastic. As a result of such tests, we 
determined that, after thermal cycling, open circuits develop which are 
evenly distributed between breaks in the neck of the ball bond and the 
heel of the wedge bond. 
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Fig. 8—Cutaway view of a 302A numeric showing the exposed bonds. 


4.3 Reliability calculations—device utilization model 


It should be very strongly emphasized that the adequacy of a given 
median life and o combination based on exhaustive tests can only be 
judged if the device designer possesses information from systems engi- 
neering on its anticipated application. Hence, a device utilization model 
for the 302A numeric is essential. 

One use of the 302A display is in Transaction telephone sets. 
If we envision very frequent operation, such as in airline terminals, fresh 
information may appear on the displays as often as every minute for a 
30-s duration. The devices operate at 10 mA dc, which leads to a 7°C 
junction heating. Thus, if the ambient temperature is 27°C, the devices 
are exposed to 34°C. During the off-state, the time (30s) is too short to 
reach thermal equilibrium with the ambient. Therefore, we assume a 
temperature fluctuation or cycling excursion of no more than AT’ = 
2°C, 

The model in combination with the reliability data derived herein 
permit the estimation of the failure rates and MTTFs during device op- 
eration under forward bias, and also on account of temperature varia- 
tions. We shall give the reliability-associated properties for both LED 
chips and digits at 20 years of service. 

(i) Bias Aging: The accepted activation energy for red LED degradation 
is 0.7 eV.° With this value, the Arrhenius-law multiplier between the 
aging temperature (132°C) and use temperature (34°C) becomes 604. 
Applying the multiplier to the chip and digit median lives, tmq = 2400 


2996 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1978 


and t,,, = 1600 hours, respectively, we obtain at the use tempera- 
ture* 


tma (34°C) = 1.45 X 10° hours, tm, = 9.66 X 10° hours. 


The standard deviations are usually taken as temperature-independent 
constants, and their values are 


Gq = 0.387 and a, = 0.27. 


It should be noted that the device failure distribution (Q,) is not strictly 
lognormal and oy results from the linearization of the dashed line in Fig. 
6. 

The above parameters yield the following MTTFst and maximum 
failure rates for a 20-year! service life beyond infant mortality:34 


MTTF, = 1.57 X 108 hours and MTTF, = 1.01 X 10° hours 
and 
Na K 1 FIT and Ay <K 1 FIT. 


These failure rates for bias aging are outstandingly low. This is a con- 
sequence of the fact that both the chip and device distributions, as shown 
in Fig. 6, are very tight, corresponding to a small o. If it is assumed that 
the investigated lot was atypical and occasionally o, and op may become 
as large as 1 and 0.9, respectively, then we obtain for the As 


Aq = 90 FITs and A, = 150 FITs, 


indicating, as expected, that the failure rate for chips is less than for 
digits. 

(it) Thermal Cycling: To relate the median number of cycles to failure, 
MCTF, obtained by long-term wide AT’ excursion experiments to the 
small AT excursions encountered in use, an acceleration factor is re- 
quired. We have estimated this factor on the basis of previous work on 
gold beam fatigue. Dais and Howland? have shown, for rubber encap- 
sulated devices tested in the plastic deformation domain, that the 
magnitude of the temperature excursion between AT = 400° and 45°C 
is a monotonically decreasing function of the median number of cycles 
to failure. We can characterize the dependence of AT on cycles by a 
power law with an exponent increasing from ~—0.5 (Coffin’s Law!°) to 
a limiting value of ~—0.1. In addition, theoretical analysis of typical 
device structures indicates that the magnitude of the maximum elastic 
AT span for rubber encapsulant is about twice that for epoxy.!! Thus, 


* The subscriptsa and b denote chip and device reliability properties, in accordance with 
the definitions in Section 4.1. 

+ MTTF = t,e77/2. 

! The effective service life is only 10 years due to the duty factor of 0.5. 
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it seems reasonable to take AT = 20°C as the transition temperature 
between elastic and plastic deformation for the epoxy encapsulant. 
Consequently, by extrapolating the results of Dais and Howland? be- 
tween AT = 165° and 20°C, we obtain ~3 X 10’ as the approximate ac- 
celeration factor appropriate for 302A numerics. Since a temperature 
cycling range of only 2°C is anticipated in use, the deformation always 
remains elastic. Hence, the acceleration factor and failure rates given 
here should be considered as lower and upper bounds, respectively. 

A combination of the acceleration factor and the experimental values 
MCTFexpa (23350) and MCTF exp» (300 cycles) for chip (a) and digit (b) 
failure, respectively, yields 


MCTFysea = 7 X 101! cycles and MCTF,seh = 9 X 10° cycles 
and oicq = 3.27 and oie, = 2, 


where o¢cp is from the hypothetical linearization of the dashed line in 
Fig. 7. Since in operation there are 60 cycles/hour, the median lives be- 
come 


tmtca = 1.2 X 10!° hours and tpiey = 1.5 X 108 hours. 


The above parameters yield the following MTTFs and failure rates, 
Nec, at 20 years of service life:3-4 


MTTFtcq = 2.5 X 1012 hours and MTTF;,5 = 1.1 X 10% hours 
Ntca = 2 FITS and );-p = 4 FITS. 


In conclusion, we find that the 302A red numeric performs very reli- 
ably in specific applications such as the Transaction telephone. 
We have shown that the long-term failure rates associated with LED 
degradation and junction heating induced thermal cycling are very low, 
namely, no more than 4 FITs over 20 years of continuous service. 
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Intelligible Crosstalk Performance 
of Voice-Frequency Customer Loops 


By K. |. PARK 
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A methodology for evaluating the intelligible crosstalk performance 
of voice-frequency customer loops is developed in this paper. Using this 
methodology, intelligible crosstalk probabilities are calculated for a 
representative sample of loops in the plant. The effect of gain on loop 
crosstalk performance is then evaluated for a particular example of 
gain application where length-dependent gain ranging approximately 
from —1 to 9 dB is added. Two possible locations of gain application 
are evaluated: the central office and the telephone set. Presently, no 
crosstalk performance objectives exist for loops. For planning purposes, 
however, an intelligible crosstalk probability of 0.1 percent has been 
used in the past as a limit for satisfactory performance. In comparison 
with this limit, the crosstalk performance of the present loop plant 
(loops without gain) is satisfactory. For the particular example of gain 
application considered in this paper, gain applied at the central office 
has only a small effect on loop crosstalk performance. However, gain 
applied at the telephone set degrades loop crosstalk performance sig- 
nificantly, increasing the crosstalk probability above the 0.1 percent 
level on about 15 percent of the sample loops evaluated. 


I. INTRODUCTION 


A telephone user occasionally receives an extraneous speech signal 
as a result of interference between communications circuits, which is 
referred to as crosstalk. Crosstalk not only produces annoyance to the 
affected customer but also constitutes loss of another customer’s privacy 
when it is intelligible, and is an important concern in transmission sys- 
tems design and planning. For example, if, with the advancement of loop 
electronics, gain devices are applied on loops to enhance the speech signal 
level, the maximum allowable amount of gain and the location of its 
application may be restricted by the resulting crosstalk performance 
degradation. In this paper, a methodology is developed for evaluating 
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the intelligible crosstalk performance of voice-frequency customer loops 
that can be used in loop transmission systems design and planning. In 
particular, the methodology can be used in (z) establishing loop crosstalk 
performance objectives, (ii) allocating the objectives to components of 
the loop plant, such as cable facilities, central office switches, and cus- 
tomer-premises wiring, and (iii) evaluating effects of new technology 
and new loop design rules on crosstalk performance. 

Intelligible crosstalk performance is measured by the crosstalk 
probability, which is defined as the probability that a customer will hear 
one or more intelligible crosstalk words during a call. The crosstalk 
probability on customer loops depends on the probability distributions 
of such random variables as call holding time, quiet interval between 
calls, disturbing talker volume, crosstalk path loss, circuit noise, and 
disturbed-listener hearing acuity. These underlying probability distri- 
butions in turn depend on telephone connection configurations and 
crosstalk coupling loss characteristics of the multipair cables used for 
loops. 

The loop crosstalk evaluation methodology developed in this paper 
can be divided into three basic parts as shown by the block diagram of 
Fig. 1: a cable crosstalk coupling model, a telephone connection model, 
and a crosstalk probability model. The cable crosstalk coupling model 
provides equations for calculating near-end and far-end crosstalk cou- 
pling losses between customer loop wire-pairs in multipair cables. The 
model contains adjustable parameters, which are estimated by fitting 
the model to measured crosstalk coupling loss data. The telephone 
connection model describes typical intraoffice (loop-to-loop) telephone 
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Fig. 1—Modeling of loop crosstalk. 
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connections as the disturbed connections and identifies potential 
crosstalk exposures to other intraoffice or toll connections. For the 
purposes of this study, loop characteristics, such as length, loading, and 
loss, are described, based on either theoretical loop design rules! or the 
information obtained from the loops sampled in the 1964 Loop Survey.? 
The crosstalk probability model, the last of the three parts shown in Fig. 
1, combines the information provided by the preceding two models with 
data on traffic activity on loops, talker volume, circuit noise, and listener 
hearing acuity, and determines, by a Monte Carlo simulation, the 
crosstalk probabilities for loops. 

The methodology developed in this paper provides the following 
features: 

(t) By virtue of the analytical cable crosstalk coupling model in- 
troduced here, the loop crosstalk performance can be evaluated as a 
function of loop length, rather than only for the fixed length for which 
measurements are available. 

(tt) The distribution of crosstalk probability is obtained for all the 
loops sharing a cable of arbitrary length, or for loops of different lengths 
sampled from the loop plant. 

(tit) The telephone connection model developed here is general 
enough to include a number of different crosstalk exposures, such as 
near-end and far-end crosstalk occurring in the disturbed customer’s 
loop and near-end and far-end crosstalk occurring in the loop of the 
customer at the other end of the disturbed connection. 

(tv) The effect of gain on crosstalk is evaluated for gain applied at 
the telephone set as well as for gain applied at the central office. 

(v) For disturbing talkers’ speech volumes, the latest speech volume 
data obtained in 1976? is used. 

A number of studies on the subject of crosstalk in general were made 
previously at Bell Laboratories, including those by T. C. Spang, B. E. 
Davis, M. G. Mugglin, D. H. Morgen,‘ and P. M. Lapsa.® Lapsa, in par- 
ticular, considered a loop crosstalk problem similar to one specific case 
of the present study—the case of the effect of gain applied at the central 
office. Focusing primarily on long rural loops with gain applied at the 
central office and considering near-end crosstalk (NEXT) at the central 
office as the major crosstalk exposure, he assumed an “electrically long” 
loop—sufficiently long to render the NEXT coupling loss independent 
of length—and used measured NEXT coupling loss data. For disturbing 
talkers’ speech volumes, Lapsa used McAdoo’s speech volume data ob- 
tained in 1960.6 He concluded that gain of 6 dB or less applied at the 
central office would be acceptable. In comparison with this, the results 
of the present paper on the effect of the central office gain are more 
optimistic because of, among other things, the use of more recent cou- 
pling loss and speech volume data in the present study, as discussed in 
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Section 3.1. In the case of the effect of gain applied at the telephone set, 
no similar study was made previously that can be compared with the 
present study. 

Section II describes the three basic models constituting the method- 
ology shown in Fig. 1 and determines the probability distributions of the 
underlying random variables. Section III evaluates the loop crosstalk 
probability in detail. Section IV is the summary of the loop crosstalk 
probability evaluation results. 


ll. METHODOLOGY 
2.1 Twisted multipair cable crosstalk coupling model 


Crosstalk performance of a customer loop depends on, among other 
things, the electromagnetic coupling characteristics between the loop 
and the other loops sharing the same twisted multipair cable. An ana- 
lytical model was developed to provide equations for the near-end and 
far-end crosstalk coupling losses between wire-pairs in a cable as a 
function of frequency, cable length, and terminating impedances. Such 
a model is necessary because coupling loss measurements are available 
only for certain frequencies, cable lengths, and terminating conditions. 
_ A detailed derivation of the model is described in an unpublished work 
by the author.’ In this section, this cable crosstalk coupling model is 
described in general terms. 

A twisted multipair cable consists of a number of twisted wire-pairs 
stranded together. Each wire-pair is used as a loop, which is permanently 
assigned to a customer as the transmission path between his telephone 
set and the serving central office. Although the wire-pairs in a cable are 
isolated from one another, a certain amount of electromagnetic coupling 
between simultaneously active pairs is unavoidable. 

As illustrated in Fig. 2, crosstalk is referred to as near-end crosstalk 
(NEXT) when the signal source on the disturbing pair and the point of 
crosstalk reception on the disturbed pair are at the same end of the cable, 
and far-end crosstalk (FEXT) when they are at the opposite ends of the 
cable. The difference in decibels between the disturbing power and the 
received crosstalk power is referred to as coupling loss. Referring to Fig. 
2, NEXT and FEXT coupling losses from pair j into pair 1, denoted by 
NEXT;; and FEXT;;, are defined by the following equations: 


NEXT;; = V ;(disturbing,near-end) = V iaistaibed uearend) (1) 
FEXT;; = Vj(disturbing,far-end) ~ Vi(disturbed,far-end): (2) 


where V; (disturbing,near-end) and Vj (disturbing,far-end) are the disturbing signal 
powers at the source and the far end on the disturbing pair, pair J, ex- 
pressed in decibels relative to a reference power; and Vj (disturbed,near-end) 
and Vi(disturbed,far-end) are the crosstalk signal powers at the near end and 


3004 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1978 


Vi 
~ J (DISTURBING, NEAR — END) CABLE 


«* TERMINATION 





(a) 


Ni 
i (DISTURBED, NEAR — END) 


_™ j (DISTURBING, FAR — END) 
/ 


CABLE 


SIGNAL 
SOURCE 





a (b) 
‘Vj (DISTURBED, FAR — END) 

Fig. 2—Definition of NEXT and FEXT: NEXT or FEXT coupling loss from pair j into 
pairi is the decibel difference between the disturbing volume V; and the crosstalk volume 
V; measured at the points shown by X. (a) Near-end crpast alle (NEXT). (b) Far-end 
crosstalk (FEXT). 


the far end on the disturbed pair, pair i, expressed in decibels relative 
to a reference power. 

Crosstalk performance of a multipair cable can be characterized by 
determining NEXT and FEXT coupling losses defined by eqs. (1) and (2) 
for all possible combinations among its wire-pairs. In this paper, the 
coupling losses are determined analytically by the cable crosstalk cou- 
pling model mentioned earlier.’? The model provides equations for NEXT 
and FEXT coupling losses as a function of frequency, cable length, and 
the terminating impedances of the disturbing and disturbed pairs. It 
contains certain adjustable parameters which are dependent on the 
proximity between pairs in a cable and which can be determined by 
fitting the model to measured crosstalk coupling loss data. ' 

The model was fitted to recent crosstalk data measured at Bell Lab- 
oratories, Atlanta, on a typical cable used in the loop plant. The data 
consisted of the NEXT and FEXT coupling losses of 300 pair-to-pair 
combinations (all possible combinations) in a 25-pair, 26-gauge, non- 
loaded polyethylene insulated cable (PIC), measured at eight different 
frequencies (2, 3, 5, 10, 28, 56, 76, and 150 kHz). The length of the mea- 
sured cable was 3 kft, and all pairs were terminated in pure resistive, 600 
ohms at both ends. For each of the 300 pair-to-pair combinations, the 
model parameters were determined by the least-squares method. Two 
examples of the results of fitting the model to the data are shown in Fig. 
3, where the abscissa is frequency and the ordinate NEXT coupling loss 
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Fig. 3—Two examples of the results of fitting the analytical cable crosstalk coupling 
model to NEXT coupling loss data measured on a 3-kft, 25-pair, 26-gauge, nonloaded PIC 
cable with pure resistive 600-Q terminations. 


in decibels. The As and Os show the measurements* and the solid 
curves, the theoretical coupling losses fitted by the model. The rms errors 
between the measurements and the fitted values for these two particular 
pair combinations are 0.8 and 1.2 dB, respectively. 

Figure 4 presents the cumulative distribution functions (CDFs) of the 
voice frequency (1 kHz) NEXT and FEXT coupling losses of all the 300 
pair-to-pair combinations of the 25-pair cable, calculated by the model 
for an arbitrarily chosen reference cable length of 1 kft. The far-end and 
near-end terminating impedances were fixed at (900-800) ohms and 
(600 + j200) ohms, respectively, the average terminating impedances 
of the loops at the central office and at the telephone set, estimated from 
the 1964 Loop Survey.? 

Coupling losses vary with cable length. Figures 5 and 6 show length 
translation factors normalized to 1 kft, as calculated by the model for 
the voice-frequency NEXT and FEXT coupling losses. Figure 5 shows that, 
beyond a certain length, in this case about 30 kft, the translation factor 
for NEXT no longer changes with length. A cable longer than this is re- 
ferred to as electrically long. From Fig. 5, the NEXT loss at such an 
electrically long length is about 7 dB smaller than the NEXT loss at 1 kft. 
Figure 6 shows that FEXT loss keeps decreasing with cable length without 
saturation. 


* As and Os in Fig. 3 identify the pair combinations with the 1-percent worst and the 
median NEXT coupling loss among the 300 measurements at 2 kHz, respectively. 
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Fig. 4—The cumulative distribution functions (CDFs) of the voice-frequency (1 kHz) 
NEXT and FEXT coupling losses calculated by the theoretical model with cable length fixed 
at 1 kft for the 25-pair, 26-gauge, nonloaded PIC cable. The coupling losses at other cable 
ai a etre by using the length translation factors calculated by the model, shown 
in Figs. 5 and 6. 


The NEXT and FEXT coupling losses at lengths other than 1 kft can 
be obtained by subtracting the corresponding length translation factors 
determined from Figs. 5 and 6 from the 1-kft coupling losses shown in 
Fig. 4. For example, the 1-percent worst NEXT coupling loss at | kft is, 
from Fig. 4, about 91 dB and the 1-percent worst NEXT coupling loss at 
an electrically long length, say 50 kft, is obtained to be 84 dB by sub- 
tracting the length translation factor of about 7 dB, determined from 
Fig. 5, from the 1-kft loss, 91 dB. 

The data used to determine the model parameters were measured on 
an unspliced, laboratory cable. In the plant, several reels of cable may 
be spliced to form a single long cable. PIC cables are straight spliced; that 
is, pair identifications on the first reel are maintained over the subse- 
quent reels. This type of splicing has theoretically no effect on the model 
prediction. For randomly spliced cables, such as pulp cables, the splicing 
may have some effect because pair locations change over the subsequent 
reels. At present, there are no appropriate field measurements that can 
be used to examine the effect of random splicing on crosstalk. However, 
other things being equal, random splicing should render the crosstalk 
prediction by the model somewhat conservative (pessimistic) because, 
with such splicing, the worst crosstalk pair combination of the first reel 
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Fig. 5—Length translation factor normalized to 1 kft, calculated by the theoretical model 
for the voice-frequency NEXT. 


would not necessarily be the worst combination in the subsequent 
reels. 

The coupling losses discussed above represent the coupling losses of 
nonloaded cables, which make up the majority* of the loops in the plant. 
At the present time, there is no theoretical means of predicting the effect 
of loading on crosstalk coupling losses. Based on Bell Laboratories 
coupling loss data measured on loaded cables, it is assumed that, other 
conditions being equal, loaded cables, which make up a relatively small 
fraction of the loop plant, have approximately 3 dB smaller NEXT losses 
than nonloaded cables at 1 kHz. For FEXT, the same FEXT coupling 
losses are used for both nonloaded and loaded cables. 


2.2 Telephone connection model 


A model of telephone connections is described in this section to 
identify potential crosstalk exposures and determine the distributions 
of received crosstalk volume and other random variables affecting 
crosstalk performance. On connections involving trunks as well as loops, 
the crosstalk on trunks is dominant. To evaluate the loop crosstalk taken 
alone, intraoffice connections, consisting of two loops connected at the 
central office, are considered the disturbed connections. Intraoffice 


* The 1964 Loop Survey (Ref. 2) shows that 84 percent of the loops sampled in the survey 
are nonloaded loops. 
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Fig. 6—Length translation factor normalized to 1 kft, calculated by the theoretical model 
for the voice-frequency FEXT. 


connections have relatively low circuit noise, providing low masking on 
crosstalk intelligibility, and thus are in general most susceptible to in- 
telligible crosstalk. As the disturbing connections, both toll and in- 
traoffice connections are considered. 

As shown in Fig. 7, a consumer at one end of an intraoffice connection 
is subject to the following four potential crosstalk exposures: NEXT and 
FEXT occurring in his own loop, and NEXT and FEXT occurring in the 
loop of the customer at the other end of the disturbed connection. 
Comparing Fig. 2 with Fig. 7, the cable end where the coupling losses are 
defined corresponds to the telephone set line-terminals for the first two 
crosstalk exposures and the central office loop terminations for the latter 
two exposures. For convenience, therefore, the first two exposures will 
be referred to in this paper as “line terminal NEXT” (LTNEXT) and “line 
terminal FEXT” (LTFEXT) and the latter two as “‘central office NEXT” 
(CONEXT) and “central office FEXT” (COFEXT). Of these four crosstalk 
exposures, LTNEXT is, in general, most important because, with this 
exposure, the disturbing talker’s volume is attenuated only by the cou- 
pling loss between the two loops involved, and there are no additional 
losses in the crosstalk path. In the other three exposures (LTFEXT, CO- 
NEXT, and COFEXT), the disturbing talker’s volume is attenuated by 
loop losses in addition to coupling losses, and thus the crosstalk from 
such an exposure is less likely to be intelligible than LTNEXT. On the 
other hand, if gain is applied on loops in the future, the relative impor- 
tance of the four crosstalk exposures may change depending on the lo- 
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Fig. 7—Four types of potential crosstalk exposures of the intraoffice (loop-to-loop) 
connection: line-terminal NEXT (LTNEXT), line-terminal FEXT (LTFEXT), central office 
NEXT (CONEXT), and central office FEXT (COFEXT). 


cation of the gain application. The effect of gain on crosstalk will be 
discussed in Section 2.4. 

The crosstalk level in VU (volume units) received at the line-terminals 
of a disturbed customer’s telephone set is the speech level (in VU) at the 
disturbing talker’s telephone set minus the loss (in decibels) of the 
crosstalk path from the disturbing talker to the disturbed listener. The 
loss of the crosstalk path includes the loss from the disturbing talker to 
the point of crosstalk coupling, the coupling loss and the loss from the 
point of crosstalk coupling to the disturbed customer’s telephone set. 
The crosstalk level on pair i received from pair j for the four crosstalk 
exposures, denoted by Vurnext,, ViTrext;, VCoNExT; and Vcorext; 
are given by the following equations: 


V wnext; = Vit; — LTNEXT;; (3) 
Viurrext, = Vco; — Le — LTFEXT;; (4) 
Vconext; = Vco; — CONEXT;; — Le (5) 

Vcorext, = Vir; — L1 — COFEXT; — Le, (6) 


where Vi7, and Vco; denote the disturbing talker volume at the line 
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terminals and the central office, LTNEXTjj, LTFEXT;j, CONEXT;; and 
COFEXT;; denote NEXT and FEXT coupling losses at the line terminals 
and the central office [as defined by eqs. (1) and (2)], and L; and Le 
denote the losses of the loops in the two cables involved in the loop- 
to-loop disturbed connection. Since talker volume may be assumed to 
have a same distribution on all pairs in a given cable, the subscript 7 may 
be dropped from the disturbing talker volume in the above equations. 

The electrical talker volume as measured at the serving central office 
was determined by a recent survey undertaken by Bell Laboratories to 
be nearly normally distributed with a mean of —22.2 VU (volume unit) 
and a standard deviation of 4.6 dB for intraoffice calls and a mean of 
—21.6 VU and a standard deviation of 4.5 dB for toll calls.? These latest 
speech volume data are used in this paper. These data show that there 
is very little difference in talker volume statistics between intraoffice 
and toll calls in contrast to the 1960 McAdoo speech volume data,® which 
showed a mean of —24.8 VU with a standard deviation of 7.3 dB for in- 
traoffice calls and a mean of —16.8 VU with a standard deviation of 6.4 
dB for toll calls. The standard deviation of the new speech data is con- 
siderably smaller than that of the McAdoo data. 

The crosstalk volume equations for LTNEXT and COFEXT, eqs. (3) 
and (6), involve the electrical volume at the telephone set line terminals 
of the disturbing talker, Vir. Presently, talker volume statistics at the 
telephone set line terminals are not available. To obtain the line-terminal 
talker volume statistics, as a function of loop length, from the central 
office statistics, the following expressions apply: 

My p(X) = (Nvco + ME} — E(x) (7) 
SVyuT = (S%e6 = sh) '/2, (8) 
where my,,,(x) and sy,,, denote the mean and standard deviation of the 
talker electrical volume at the telephone set line-terminals, myc, and 
SVco the mean and standard deviation of the talker volume measured 
at the central office, E1(x) the acoustic-to-electric transducer power 
loss,* as a function of loop length x, between the input acoustic pressure 
applied at the telephone set transmitter and the output voltage produced 
at the telephone set line terminals, and mg, and sz, the mean and 
standard deviation of the acoustic-to-electric transducer power loss 
between the acoustic pressure at the telephone set transmitter and the 
output voltage at the loop termination at the central office. 

In (7), the term in the braces translates the mean electrical talker 
volume at the central office, my¢o, into the mean acoustic pressure at 
the transmitter by adding the mean acoustic-to-electric power loss, mz,, 


* These transducer power losses are similar to, but different from, the EARS (Electro- 
Acoustic Rating System) losses discussed in Section 2.4.1: these power losses are fre- 
quency-weighted in a different manner than the EARS losses. 
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averaged over a representative population of loops of various lengths. 
This translation assumes that the talker acoustic pressure at the trans- 
mitter is not correlated with loop length. The subtraction of F(x), the 
acoustic-to-electric power loss at a given loop length x, from the term 
in the braces translates the mean acoustic pressure into the mean elec- 
trical speech volume at the line terminals for that specific loop length 
x. Figure 8 shows the mean electrical speech volume at the telephone 
set line terminals as a function of loop length, obtained by eq. (7) from 
the mean central office talker volume of —22.2 VU of intraoffice calls 
presented in Ref. 3. From (8), the standard deviation of the line-terminal 
talker volume is determined to be 3.9 dB. , 

Circuit noise received at the end of the intraoffice (loop-to-loop) 
connection is the power sum of three independent noises: (1) the far-end 
talker’s carbon transmitter noise (N1), attenuated by the losses of the 
two loops of the connection, (11) the noise of the far-end talker’s loop 
including the noise contributed by the central office (No), attenuated 
by the loss of the near-end loop, the disturbed listener’s loop, and (iii) 
the noise of the near-end loop (N3): 


N = (N; — Li — Lg) ® (N2 — Le) @ N3, (9) 


where L, and Lz denote the losses of the two loops in the connection and 


MEAN SPEECH LEVELIN VU AT THE TELEPHONE SET 
LINE-—TERMINALS 
| 
N 
(os) 
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LOOP LENGTH IN KFT 


Fig. 8—Mean electrical talker volume at the telephone set line terminals as a function 
of loop length, obtained from eq. (7) with the central office mean talker volume, ~22.2 VU, 
of intraoffice calls. 
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® represents the power sum operator.* The far-end talker’s carbon 
transmitter noise is assumed to have a constant value of 10.2 dBrnC.t 
The 1964 Loop Survey? shows that loop noise has little correlation with 
loop length. Based on the 1964 Loop Survey data, loop noise is assumed 
to be normally distributed with a mean of —1.1 dBrnC without the 
central office noise and a mean of 5.6 dBrnC with the central office noise. 
The standard deviation of loop noise is assumed to be 12.5 dB, both with 
and without the central office noise. By a Monte Carlo evaluation of eq. 
(9) with the above component noise statistics, the mean and the standard 
deviations of the total received noise of the intraoffice connection were 
determined as a function of the disturbed listener’s loop length. For 
example, for a loop-to-loop connection, with the length of both loops 
fixed at 7 kft, the mean and standard deviation of the received noise are 
determined to be 10.5 dBrnC and 8.5 dB, respectively. 


2.3 Crosstalk probability model 


The discussions hitherto have been concerned with the determination 
of crosstalk coupling losses, received crosstalk levels for potential 
crosstalk exposures, and received circuit noise. Whether or not a cus- 
tomer will actually receive intelligible crosstalk, however, is a random 
event. A mathematical model is developed in this section to evaluate the 
probability of hearing intelligible crosstalk on loops. 

For a customer to receive intelligible crosstalk, the following two 
conditions must be met simultaneously. First, a potential disturbing 
circuit must become active during the period when the customer under 
consideration is engaged in a telephone conversation. Given that the first 
condition has been met, exposing the customer to crosstalk, the second 
condition is that the received crosstalk level must exceed the disturbed 
customer’s intelligibility threshold in the presence of circuit noise. The 
probability that a customer on loop pair i will receive intelligible crosstalk 
from another loop pair, pair j, in the same cable, denoted by P;;, is ex- 
pressed by the following equation: 

P;; = Pripair j active/pair i active} X Pr{Vi; > T(N)}, (10) 
where V;; denotes the crosstalk volume on pair 1 received from pair J 
and T(N) denotes intelligibility threshold in the presence of circuit 
noise NV. 

The probability of activity coincidence between loops, the first 
probability in the right-hand side of (10), depends on the distributions 
of call holding time and quiet interval between calls on loops. This 
probability was determined in Ref. 5 for average busy-hour loop traffic 
to be 

Pr{pair j active/pair i active} = 0.17. (11) 


* A ® B= 10 logio(104/19 + 103/19), 
+L. M. Padula, Bell Laboratories, private communication. 
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The probability of crosstalk intelligibility, the second probability in 
the right-hand side of (10), depends on the distributions of crosstalk 
volume, circuit noise, and listener intelligibility threshold. The received 
crosstalk volume and circuit noise are determined by (3) through (6) and 
(9), with the distributions discussed in Section 2.2. Listener intelligibility 
threshold, which is determined by subjective tests, is defined quanti- 
tatively as the speech level at which a subject is just able to understand 
one or more words of the crosstalk content presented to him in the 
presence of masking noise.® 

Intelligibility threshold increases as a function of noise. When noise 
is relatively high, the increase in intelligibility threshold with noise is 
linear, that is, decibel for decibel. At low noise levels, the relationship 
between intelligibility threshold and noise is nonlinear: in this region 
of noise, as noise is decreased toward an infinitely small value, intellig- 
ibility threshold approaches a constant rather than continuously de- 
creasing, indicating a human ear’s absolute threshold independent of 
noise. This functional relationship between intelligibility threshold and 
noise can be expressed by the following equation in terms of a random 
variable independent of noise, 7'9, and a term varying nonlinearly with 
noise: 

T(N) = To + (N @ 12.8) VU, (12) 


where ® represents the power sum operator defined previously. The 
above equation is a mathematical expression of the intelligibility 
threshold data presented by T. K. Sen.? Sen’s data show that T, is nor- 
mally distributed with a mean of —95 VU and a standard deviation of 
2.5 dB for a crosstalk coupling mechanism with a flat frequency spec- 
trum. Sen also observed that the mean of Tp) should be lowered by 2 dB 
to —97 VU for a crosstalk coupling mechanism with coupling losses that 
roll off with frequency by 6 dB per octave. Since, as can be seen in 
Fig. 3, crosstalk coupling losses over the voice band have a 6-dB per oc- 
tave roll-off, To is assumed to have a mean of —97 VU and a standard 
deviation of 2.5 dB. 

Substituting (12) into (10), we have the following expression for the 
probability of crosstalk intelligibility: 


Pr{V;; > T(N)} = Pr{ Vi; —To—(N @ 12.3) > O}. (13) 


Because of the power sum, (N ® 12.3), analytical evaluation of the above 
equation is not possible even for normally distributed random variables. 
A simple but crude way of treating the term (N @ 12.3) would be to ap- 
proximate it with a normal variate. However, such an approximation will 
result in pessimistic results because the normality assumption allows 
an infinitely low value for the term when, in fact, the random term (N 
® 12.3) can never be smaller than 12.3. In this paper, therefore, the above 
probability is evaluated by a Monte Carlo method. 
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To apply a Monte Carlo method, the above equation is manipulated 
in the following manner. For a fixed value of noise, say N = nz, and as- 
suming normal distributions for other random variables, it can be shown 
that the crosstalk intelligibility probability is given in terms of the 
standardized normal cumulative distribution function © as follows: 


Pr{Vj; > T(N)/N = np} 
= Pr{Vi; — To — (nz ® 12.3) > 0} 


= {my;; —- NT) — (n, ® 12.3)} 
= k i Gi, Faye aE) | (14) 
where 
(a) =~ (* e-*2dx. 
Vx J-« 


The Monte Carlo evaluation procedure of (13) then consists of generating 
a sequence of random numbers according to the distribution of noise N 
and evaluating the following average:* 


{my;; —- MT) — (np ® 12.3)} 
(si, + 87)? 


where M is the number of random samples drawn for noise N, my,, and 
sy,,; are the mean and standard deviation of the received crosstalk volume 
determined for a given crosstalk path using (3) through (6), and m7, and 
S$, are the mean and standard deviation of the random variable To of 
(12). 

Using the last equation and the activity coincidence probability of (11) 
in (10), the probability that a customer on pair will receive intelligible 
crosstalk from pair j, P;;, is given by 


P;j = 0.17 X eq. (15). (16) 


1 k=M 
Pr{Vi; > T(N)} = M 2 ® | | (15) 


Finally, the crosstalk probability that the customer on pair i will receive 
intelligible crosstalk from any of the remaining N-1 pairs in the N-pair 
cable, denoted by P;, is given by, assuming small P;;: 


JEN 4c -d 
P; = > Piy,t=1,2,...,N,J Al. (17) 
j=l 


2.4 Effect of gain 


The received crosstalk volume and circuit noise equations, (3) through 
(6) and (9), assume no gain devices on loops, as is the case in the current 
loop plant. With the advancement of loop electronics, the present loop 


* Lapsa (Ref. 5) evaluated a similar probability by numerical evaluation of convolution 
integrals. 
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design rules may change in the future and require application of gain 
on loops. The effect of gain on crosstalk performance is discussed in this 
section for a particular example of gain application where the required 
gain is determined as a function of loop length to meet a certain constant 
loop loss. 


2.4,1 Loudness loss of telephone connections 


Voice communications over a telephone connection are accomplished 
by conversion of a talker’s acoustic pressure at the transmitter into an 
electrical signal, transmission of the electrical signal over a transmission 
medium to the receiving telephone set at the far end and reconversion 
of the received electrical signal into an acoustic pressure at the listener’s 
receiver. The loudness of the speech perceived by the listener depends 
on the magnitude of the talker’s acoustic pressure and the loss and fre- 
quency characteristics of the transmitter, the receiver and the trans- 
mission medium. The loudness loss between the input and output 
acoustic pressure of a connection is quantified by means of the 
Electro-Acoustic Rating System (EARS), and is referred to asthe EARS 
loss of the connection. For a complete and extensive discussion on the 
subject of EARS, the reader may refer to Ref. 9. 

For interoffice or toll connections, the transmission path consists of 
one or more trunks in tandem between the two end offices, which are in 
general derived on carrier facilities, plus a loop at each end. The EARS 
loss of such a connection is given by the sum of the transmit loop rating 
(TLR) of the talker’s loop (transmit loop) and the receive loop rating 
(RLR) of the listener’s loop (receive loop), plus the electrical loss of the 
intervening trunks. For intraoffice connections, the transmission path 
consists of two loops connected together at the central! office. The EARS 
loss of such a loop-to-loop connection is approximately the sum of the 
TLR and the RLR of the two loops. 

The TLR is defined in terms of an acoustic pressure spectrum specified 
by the EARS methods at the transmitter of a telephone set and the re- 
sulting EARS frequency-weighted, electrical voltage (EARS voltage) 
produced at the transmit loop termination at its central office. The RLR 
is defined in terms of an EARS voltage applied at the central office ter- 
mination of a receive loop and the resulting acoustic pressure produced 
at the telephone set receiver at the other end of the receive loop. The TLR 
and RLR have a unit analogous to decibels and are loss-like quantities 
in the sense that an algebraically larger TLR and RLR respectively cor- 
respond to a lower output EARS voltage at the central office and a lower 
output acoustic pressure at the receive telephone set. 
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Under the present loop design rules, both TLR and RLR vary with loop 
length, and consequently the EARS loss over the local portion* of a 
connection varies with the lengths of the two loops. A recent study! 
examined the possibility of providing a constant EARS loss for the local 
portions of all connections, regardless of loop length. Such a loss plan 
would permit EARS loss equalization of intraoffice (loop-to-loop) con- 
nections for all loop lengths, but would require changing loop design rules 
to allow for incorporation of gain. Since application of gain would raise 
the crosstalk level, the maximum amount of allowable gain may be 
limited by the consequent crosstalk performance degradation. 

The amount of gain required for loop EARS loss equalization depends 
primarily on three factors: (z) the constant EARS loss objective for local 
portions, (ii) allocation of the EARS loss objective to the TLR and RLR, 
and (iii) the present values of TLR and RLR, which are determined 
largely by the length of the loop. In this paper, the required gain is de- 
termined as a function of loop length to meet a constant TLR of —21 dB 
and RLR of 27 dB, regardless of loop length, which amount to a constant 
EARS loss of 6 dB for intraoffice (loop-to-loop) connections for all loop 
lengths. This constant EARS loss of 6 dB, allocated as —21 dB to TLR and 
27 dB to RLR, was examined as a possible alternative in the recent study 
mentioned previously!° to evaluate long-term loss plans for the loop 
plant. 

Presently, loops are designed according to the resistance-design rules! 
that control the electrical losses of loops by limiting loop resistance and 
requiring load coils when the length exceeds 18 kft. The resistance-design 
rules are applied with respect to the longest loop among the loops sharing 
a same cable, and thus the rest of the loops in the same cable would ex- 
hibit less loss. The longest loop, or the maximum-loss loop, in a cable 
assumed to conform to the resistance-design rules will be referred to as 
a theoretical resistance-design loop. 

The TLR and RLR are shown in Figs. 9 and 10 as a function of loop 
length for a theoretical resistance-design loop.t The constant TLR and 
RLR are indicated by dashed horizontal lines. The amount of gain re- 
quired to meet the constant TLR or RLR is then given by the difference 
between the horizontal line and the length-dependent curve. The re- 
quired gain is shown in Fig. 11 as a function of loop length. The required 
transmit and receive loop gains range approximately from —3 to 9 dB 
and from —1 to 4 dB, respectively. At short loop lengths, the required 
gain is negative, indicating that a loss, rather than a gain, is required for 
the loop loss equalization. 


* In this study, the local portion of a connection refers to that part of the connection 
which comprises the loop plus the telephone set at each end of the connection. 

+ The TLR and RLR shown in these figures were calculated with a computer program 
developed by F. B. Stallman, Bell Laboratories. 
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Fig. 9—Transmit loop rating (TLR) of the theoretical resistance-design loop and a 
constant loudness loss design loop. 


2.4.2 Effect of gain on crosstalk volume and noise 


For a given amount of gain, the effect on crosstalk performance de- 
pends on the location of its application. In this paper, we consider two 
possible locations: the telephone set and the central office. 

Referring to Fig. 12, which is the same as Fig. 7 except for the gain, the 
crosstalk volume equations, (3) through (6), are modified for gain applied 
at the telephone set as shown below: 


VLTNEXT; =Vyurt Gr, — LTNEXT;; + Gry (18) 
Vitrext, = Vco — Lg — LTFEXT;; + Gr, (19) 
VcONEXT; = Veo — Le - CONEXT}; + Gr, (20) 


VcOFEXT;; =Vyprt Gr, = Ly — COFEXT;; — Le + Gr,. (21) 
The received noise equation, (9), is modified as follows: 
N= (N, + Gr, — Ly — Lo + Gp,) 
®@ (No — Lot Gr.) ® (N3 + Gr). (22) 
Referring to Fig. 13, gain applied at the central office will not affect 
LTNEXT but will affect LTFEXT, CONEXT, and COFEXT. Since the latter 
three types of crosstalk exposures are in general less significant than the 


first, the effect of the gain is less pronounced when applied at the central 
office than at the telephone set. However, depending on loop length, the 
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Fig. 10—Receive loop rating (RLR) of the theoretical resistance-design loop and a con- 
stant loudness loss design loop. 
amount of required gain might be sufficiently large to make these 
crosstalk exposures significant. The following equations give crosstalk 
volumes and circuit noise when gain is applied at the central office: 


VLTNEXT, = Same as eq. (3) (23) 

Virrext, = Vco + Gr, — Le — LTFEXT; (24) 
Vconext; = Vco + Gr, — CONEXT;; + Gr, + Gr,—Le (25) 
Voorext, = Vir — Li — COFEXT; + Gr, + Gr, — Le (26) 


N= (Ni, —-—L,+ Gr, + Gr, — Le) 
® (No + Gr, + Gro — Le) @® Ns (27) 


lll. RESULTS 


The loop crosstalk probabilities were determined first for theoretical 
resistance-design loops and then for the 1100 loops sampled in the 1964 
Loop Survey.? In each case, the crosstalk probabilities were determined 
both without gain and with gain. In the case of loops with gain, two 
possible locations of gain application were evaluated: the telephone set 
and the central office. Sections 3.1 and 3.2 present the crosstalk proba- 
bilities determined for the theoretical resistance-design loops and the 
actual loops, respectively. 
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Fig. 11—The required gain on the transmit and receive loop derived from the curves 
of Figs. 9 and 10. 
3.1 Theoretical resistance-design loop 


As discussed in Section 2.3, the crosstalk probability for a loop is ob- 
tained by summing the crosstalk probabilities between that loop and 
the rest of the loops in the same cable, considering the four potential 
crosstalk exposures shown in Fig. 7: LTNEXT, LTFEXT, CONEXT, and 
COFEXT. The crosstalk probability for loop pair i, P;, for example, is 
obtained first by determining the probability P;; for all j, 7 # i, by eq. 
(16) in connection with eq. (3) through (6) for the four crosstalk expo- 
sures and then summing P;; over j, as expressed by eq. (17). The crosstalk 
probability P; so determined for loop pair i will be referred to as the total 
crosstalk probability of the pair, and represents the probability of re- 
ceiving intelligible crosstalk on that loop from any of the remaining loops 
in the cable through any of the four possible crosstalk exposures. 

Table I presents the total crosstalk probabilities calculated for each 
of the 25 loops of the 25-pair cable used in the cable crosstalk coupling 
model, with intraoffice type disturbing and disturbed connections. The 
crosstalk probability with the toll type disturbing connection was almost 
the same as that with the intraoffice type disturbing connection because, 
as discussed in Section 2.2, there was very little difference between the 
intraoffice and toll talker volume statistics.? All probabilities discussed 
hereafter are the probabilities with the intraoffice type disturbing con- 
nection. 

For the particular results shown in Table I, the two loops of the dis- 
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Fig. 12—Application of gain at the telephone set. (a) NEXT. (b) FEXT. 


turbed connections were both assumed to be 7 kft, which was estimated 
to be a typical length of the Bell System loops, based on the 1964 Loop 
Survey.? The loop length dependence of the crosstalk probability is 
discussed later. As can be seen in this table, there is a wide difference 
in crosstalk probability between pairs in a cable: the highest crosstalk 
probability is 3.19 X 10-4 percent (pair 18), the median probability, 1.45 
X 10~* percent (pair 14), and the smallest probability, 1.15 X 10-® per- 
cent (pair 25). 

The crosstalk probability of the worst loop, pair 18, was evaluated as 
a function of the disturbed customer’s loop length as presented in Fig. 
14. Unlike the disturbed customer’s loop, which is permanently assigned 
to the customer, the other loop of the disturbed connection occurs ran- 
domly, depending on the called party. The length of this latter loop was 
fixed at 7 kft, the representative length mentioned previously. The 
dashed curves show the crosstalk probabilities for the four exposures, 
LTNEXT, LTFEXT, CONEXT, and COFEXT, and the solid curve shows 
the total crosstalk probability, the sum of the four probabilities. As can 
be seen, the probability of LTNEXT is dominant at all loop lengths except 
at lengths less than about 2 kft at which the probability of COFEXT is 
dominant. 
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Fig. 13—Application of gain at the central office. (a) NEXT. (b) FEXT. 


Since LTNEXT is the dominant crosstalk, the pattern of variation with 
loop length of the total crosstalk probability in Fig. 14 is determined by 
the pattern of the LTNEXT probability variation. The behavior of the 
LTNEXT probability with loop length can be explained by considering 
the corresponding crosstalk volume equation (3). As can be seen from 
Figs. 5 and 8, both NEXT coupling loss and line-terminal electrical speech 
level decrease with increasing loop length. At short loop lengths, since 
NEXT coupling loss decreases with loop length much faster than dis- 
turbing speech volume, the received crosstalk volume of LTNEXT, and 
consequently the LTNEXT probability, increases with loop length. As 
loop length is increased further, however, NEXT coupling loss approaches 
a saturation, that is, the length translation factor given in Fig. 5 does not 
change, whereas disturbing speech volume still decreases steadily with 
loop length. Therefore, the received crosstalk volume for LTNEXT, and 
consequently the LTNEXT probability, decreases as loop length is in- 
creased beyond a certain point; in this case, about 9 kft. 

According to the resistance-design rules,! a cable is loaded when its 
length exceeds 18 kft. As discussed in Section 2.1, a loaded cable is as- 
sumed to have a NEXT coupling loss 3 dB less than a nonloaded cable. 


3022 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1978 


Table | — The total crosstalk probabilities of the 25 loops of the 
25-pair, 26-gauge, nonloaded PIC cable, obtained by treating each 
loop as a 7 kft, theoretical resistance-design loop engaged in an 
intraoffice (loop-to-loop) connection 


Rank Pair No. Crosstalk Probability (%) 
1 18 3.19 X 1074 
2 8 2.82 X 1074 
3 10 2.21 X 1074 
4 4 1.93 X 10-4 
5 7 1.43 X 1074 
6 19 1.23 X 1074 
7 5 1.22 x 10-4 
8 20 1.20 X 10-4 
9 24 6.29 X 1075 

10 22 4.54 X 1075 
11 11 4.02 X 107-5 
12 2 3.69 X 1075 
13 14 1.45 X 1075 
14 15 1.29 xX 10-5 
15 13 1.02 X 1075 
16 12 1.00 X 1075 
17 9 4.41 X 1076 
18 23 3.64 X 1076 
19 6 3.06 X 1076 
20 17 2.60 X 10-6 
21 16 2.45 X 1076 
22 1 2.01 X 10-6 
23 21 1.96 X 1076 
24 3 1.33 X 1076 
25 25 1.15 x 10-6 


The sudden increase in the LTNEXT probability at 18 kft is due to the 
3-dB drop in NEXT coupling loss with loading. At loop lengths greater 
than 18 kft, both disturbing talker’s electrical signal level and NEXT 
coupling loss are fairly constant with loop length, and the LTNEXT 
probability does not change much with loop length. 

The effect of gain on the crosstalk probability of the theoretical re- 
sistance-design loop is shown in Fig. 15. The solid curve is the total 
crosstalk probability without gain, the same curve as that shown in Fig. 
14, and the two dashed curves are the total crosstalk probability with 
gain at the telephone set and at the central office, respectively. Without 
gain, the total crosstalk probability of the theoretical resistance-design 
loop does not exceed 0.002 percent at all loop lengths. Gain applied at 
the central office shows very little effect on the crosstalk probability. This 
is because gain applied at the central office does not affect LTNEXT, the 
dominant crosstalk, as shown by eq. (23). However, with gain applied 
at the telephone set line terminals, the total crosstalk probability of the 
theoretical resistance-design loop can increase up to as much as 0.5 
percent, depending on loop length. 

Currently, no crosstalk objectives exist for loops. However, for plan- 
ning purposes, a crosstalk probability of 0.1 percent has generally been 
used in the past as a limit for satisfactory loop crosstalk performance. 
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Fig. 14—Crosstalk probabilities of the theoretical resistance-design loop, without gain, 
evaluated for the worst pair (pair 18) of the 25-pair, 26-gauge PIC cable. 


In comparison with this limit, the crosstalk performance of the present 
resistance-design loops is, from Fig. 15, more than satisfactory. 

For the particular example of gain application considered in this paper, 
gain applied at the telephone set can cause a significant degradation in 
loop crosstalk performance, depending on loop length. To relate the 
increase in crosstalk probability to the amount of gain applied, one may 
compare Figs. 11 and 15. Figure 15 shows that, with gain applied at the 
telephone set, the crosstalk probability exceeds the 0.1-percent level, 
the limit mentioned previously, at about 12 kft of loop length. From Fig. 
11, one may find that the required gain assumed at this length is 6 dB 
for the transmit loop and 2 dB for the receive loop, which amounts toa 
total gain of 8 dB on a crosstalk path. The maximum allowable telephone 
set gains at other loop lengths and for other values of permitted crosstalk 
probability can be determined similarly. 

With gain applied at the central office, the crosstalk performance of 
the theoretical resistance-design loop still remains well below the level 
of 0.1-percent crosstalk probability, for the entire range of gain consid- 
ered, where the maximum transmit and receive loop gains were about 
9 and 4 dB. Ina similar evaluation made previously, Lapsa® concluded 
that 9 dB of gain applied at the central office would be excessive. Because 
of the differences in the methodology as well as in the coupling loss and 
speech volume data used in the evaluation, a direct comparison between 
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Fig. 15—The effect of gain on the crosstalk probability of the theoretical resistance- 
design loop, evaluated for the worst pair (pair 18) of the 25-pair, 26-gauge PIC cable. 


the previous results and the present results is not possible. Nevertheless, 
the present results on the effect of the central office gain are, in general, 
somewhat optimistic in comparison with the previous results because 
of, among other things, the use of more recent coupling loss and speech 
volume data in the present study.* 


3.2 The 1964 survey loops 


The 1964 Loop Survey results? provide such information as length and 
loading conditions on 1100 loops sampled in the plant. Using this in- 
formation, the crosstalk probabilities were calculated for the 1100 sample 
loops, first without gain and then with gain assumed either at the tele- 
phone set or at the central office. Each loop was treated as though it was 
the worst pair in a cable, such as pair 18 of Table I. This worst-case 
evaluation was made because, due to the permanent assignment of a loop 
to a customer, poor crosstalk performance would focus on a single cus- 
tomer rather than being distributed among many customers. 

The total crosstalk probabilities calculated for the 1100 sample loops 
are shown in Fig. 16 as a scatter plot, where the abscissa is the length and 


* The more recent coupling loss data used in the present study show better crosstalk 
performance than the coupling loss data used in the previous study. As discussed in Section 
2.2, the speech volume data used in the present study show a much smaller standard de- 
viation than the McAdoo data used in the previous study, the smaller speech volume 
variability yielding a smaller crosstalk probability. 
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Fig. 16—Scatter plot of the total crosstalk probabilities of the 1964 survey loops, without 
gain, obtained by assuming that each loop was the worst pair in a cable. 


the ordinate the crosstalk probability. For comparison, the total crosstalk 
probability of the theoretical resistance-design loop is superimposed as 
a solid curve, which is the same curve as that shown in Fig. 14. The cu- 
mulative distribution functions (CDFs) of the crosstalk probabilities of 
the 1100 sample loops without gain are presented in Fig. 17, where the 
solid curve shows the CDF of the total crosstalk probability and the 
dashed curves show the CDFs of the LTNEXT, LTFEXT, CONEXT, and 
COFEXT probability. 

The effect of gain on the crosstalk performance of the sample loops 
was evaluated with the required gain determined by the difference be- 
tween the constant TLR of —21 dB and RLR of 27 dB mentioned previ- 
ously and the actual TLR and RLR, which were calculated from the in- 
formation provided by the 1964 Loop Survey. The results are compared 
with the crosstalk probability determined for the present plant (loops 
without gain) in Fig. 18. The solid curve is the CDF of the crosstalk 
probability of the sample loops without gain (the same curve as that 
shown in Fig. 17) and the two dashed curves show the CDFs of the 
crosstalk probabilities with gain applied at the telephone set and at the 
central office, respectively. 

Without gain, the total crosstalk probability is less than 0.01 percent 
for all the sample loops; the median is 3 X 10-4 percent. This indicates 
that the crosstalk performance of the present loop plant is more than 
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Fig. 17—Cumulative distribution functions of the crosstalk probabilities of the 1964 
ated loops, without gain, obtained by assuming that each loop was the worst pair in a 
cable. 


satisfactory in comparison with the 0.1 percent crosstalk probability 
limit. Gain applied at the central office shows only a small effect on the 
distribution of the loop crosstalk probabilities in the plant. However, 
gain applied at the telephone set changes the distribution of the loop 
crosstalk probabilities significantly, increasing the crosstalk probability 
above the 0.1-percent level on about 15 percent of the sample loops 
evaluated. 

IV. SUMMARY OF THE RESULTS 


The intelligible crosstalk probability is defined as the probability that 
a customer will hear one or more intelligible crosstalk words during a call. 
The intelligible crosstalk probability for a loop is obtained by summing 
the probabilities of intelligible crosstalk between that loop and the rest 
of the loops in the same cable, considering the four potential crosstalk 
exposures shown in Fig. 7. Using the methodology developed in Section 
II, the crosstalk probabilities have been calculated first for theoretical 
maximum-loss resistance-design loops! as a function of loop length and 
then for the 1100 loops of various lengths sampled from the loop plant 
in the 1964 Loop Survey.? 

The crosstalk probabilities were obtained first for loops as they exist 
in the present plant, that is, loops without gain. The effect of gain devices 
on the loop crosstalk probabilities was then evaluated for a particular 
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Fig. 18—The effect of gain on the distribution of the total crosstalk probabilities of the 
1964 survey loops, obtained by assuming that each loop was the worst pair in a cable. 


example of gain application. In this example, the assumed gain was de- 
termined as a function of loop length to meet a constant TLR (Transmit 
Loop Rating) of —21 dB and RLR (Receive Loop Rating) of 27 dB, re- 
gardless of loop length, which would equalize the EARS (Electro-Acoustic 
Rating System)* loss of intraoffice (loop-to-loop) connections at a 
constant value of 6 dB. For this particular example, gain required for 
a loop in its transmit direction and receive direction ranged roughly from 
—3 to 9 dB and from —1 to 4 dB, respectively. Two possible locations of 
gain application were evaluated: the central office and the telephone 
set. 

Table I shows rank-ordered crosstalk probabilities of the 25 theoretical 
resistance-design loops without gain in a 25-pair cable, determined with 
loop length fixed at 7 kft, a representative length of Bell System loops. 
Figure 15 presents the crosstalk probability of the worst of the 25 loops 
(pair 18 in Table I) as a function of loop length for the three different 
cases: loops without gain (the present plant), loops with gain at the 
central office, and loops with gain at the telephone set. Figure 18 presents 
the cumulative distribution functions (CDFs) of the crosstalk probabil- 
ities of the 1100 sample loops obtained by treating each sample loop as 
the worst loop in a cable (such as pair 18 of Table I). 

Presently, no crosstalk objectives exist for loops. For planning pur- 


* See Section 2.4.1 of this paper and Ref. 9 for the discussion of EARS, TLR, and RLR. 
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poses, however, a crosstalk probability of 0.1 percent has generally been 
used as a limit for satisfactory loop crosstalk performance. In comparison 
with this limit, the crosstalk performance of the present loop plant (loops 
without gain) is more than satisfactory, as can be seen in Fig. 18. With 
gain at the central office, the crosstalk probability still remains well below 
the 0.1-percent level for all the sample loops, and thus gain applied at 
the central office does not appear to have any significant effect on loop 
crosstalk performance for the entire range of gain considered. However, 
with gain applied at the telephone set, the.crosstalk probability exceeds 
the 0.1-percent level on about 15 percent of the loops evaluated. These 
results indicate that, for the particular example of gain application 
considered in this paper, gain applied at the telephone set may cause a 
significant crosstalk performance degradation. 
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On the Stability of Interconnected Systems* 
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Theorems are presented concerning conditions for the input-output 
stability of interconnected dynamical systems. Results in the area of 
input-output stability are often partitioned into two categories: 
small-gain type-results and passivity-type results. The main theorem 
given here does not fall into either of these categories, but is most closely 
related to the passivity-type results. The theorem involves a new class 
of interconnection operators that is a substantial generalization of the 
familiar set of nonnegative operators defined on a space of vector- 
valued functions. 


Il. INTRODUCTION 


In this paper, theorems are presented concerning the input-output 
stability of interconnected systems.* Results in the area of input-output 
stability are often partitioned into two categories: small-gain type results 
and passivity-type results. The main theorem given here does not fall 
into either of these categories, but is most closely related to the passiv- 
ity-type results. The theorem involves a new class of interconnection 
operators that is a substantial generalization of the familiar set of non- 
negative operators defined on a space of vector-valued functions. 

The mathematical model considered throughout the paper is described 
in Section II, and results of a general nature concerning the model are 
given in Section III. The case in which the interconnection operator has 
a certain matrix representation is treated in considerable detail in Sec- 
tion IV. In Section 4.5, a specific example is given of a stable intercon- 
nected system for which the interconnection matrix does not meet the 
nonnegative-definiteness requirement of the criterion given in Ref. 7, 
which contains the most pertinent earlier stability result. 


* This paper was presented at the 1978 IEEE Symposium on Circuits and Systems (New 
York, May 17-19, 1978). 

+ For background material in book form concerning intput-output stability, see, for ex- 
ample, Refs. 1-4. Interconnected systems (which are systems whose natural or artificial 
decomposition into subsystems plays a prominent role in their mathematical analysis) 
have been considered by many researchers. See, for example, Refs. 3, 5, 6, and 7. Although 
some interesting and significant results have been obtained concerning the stability of 
interconnected systems, the theory is very much in its initial stages of development. 
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The main purpose of the paper is to introduce a new concept that is 
believed to be useful. No attempt is made to present the sharpest possible 
stability results that the concept can be used to obtain. 


I. THE MODEL 
2.1 Preliminaries 


Let K denote a real linear space that contains a normed linear inner- 
product space L with inner product (-,-) and norm | -| related by | f| = 
(f,f)'”2 for f € L. (Of particular interest to us is the case in which L is the 
set Lo of all real Lebesgue square-integrable functions defined on the 
half line [0,~) with the usual inner product, and K is the “extended” set 
Eg of real functions defined on [0,~) such that each function is square 
integrable on [0,7] for any nonnegative number 7.) 

For each 7 2 0, let P, denote a linear mapping of K ints L (e.g., if K 
= Eo, let P, be defined ie (P,f)(t) = f(t) for t € [0,7] and f(t) = 0 fort 
> 7, where f is an arbitrary element of Eg). 

Let K, L, and P, be such that (i) g € L if and only if g € K and sup, 
|P.g| < , (ii) |g| = sup, |P,g| for g € L, and (iii) (P,f,g) = (P-f,P.8) 
and |P.f| < |f| for f and g in L andr = 0. 

We let L” and K", in which n is any positive integer, denote the n-fold 
Cartesian product of L and K, respectively. The norm of an element h 
= (hy,hog,...,hn) of L” is denoted by |h| and is defined by |h| = 
|; |2)2/2. 

It is assumed that L contains n elements e},€9, . . . ,e¢, such that |e; | 
= 1 for each i and (e;,e;) = 0 fori # j.* 

We say that an operator T that maps K into itself (i.e., an operator 
T in K) is causal if and only if P,T = P,TP, on K for all 7 = 0. 


2.2 The basic equations 


Throughout the paper, attention is focused on.an interconnected 
system governed by 


n 
xi + ps A;Bjx7= yi, t= 1,2,...,n, (1) 
J=1 


in which (A.1): x; and y; belong to K for allz, and A;; and B; are causal 
operators in K for alli and j. 

In (1), each B; is associated (sometimes somewhat indirectly) with a 
subsystem, and the A;; ordinarily take into account the way in which the 
subsystems interact. Typically, it is not difficult to show the existence 
of a solution x1, X9,...,Xp of (1) for any given yj, yo, ... ,¥n under some 
weak additional hypotheses. (Successive-approximation type arguments 
of the kind commonly used in connection with nonlinear Volterra inte- 
gral equations often suffice.) 


* This assumption is used only in Section IV. 
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We assume that (A.2): each B; is nonnegative in L, in the sense that 
each B; maps L into L and there exists a nonnegative constant a such 
that 


(Bjyw,w) 2 alw|? (2) 


for w € L and all j. It is assumed also that (A.3): each A;; maps L into 
itself, and there is a positive constant y such that 


|Ajw| S y|w| (3) 


for w € L and alli and j.* 
It is often convenient to write (1) in the form 


x + ABx = y, (4) 


in which x = (x1,%9,...,%n), Y = (V1,¥9,---,¥n), and A and B are the 
mappings of K” into K” defined by (Af); = 2;Aj;f; and (Bf); = Bif; for 
all f © K” and each i. 


2.3 Definition of stability 


We say that (4) is L-stable if and only if y € L” implies that x € L” 
with |x| < p(|y]) for some nonnegative continuous function p that de- 
pends only on A and B and is defined on the nonnegative reals such that 
p(0) = 0. 


Ill. Sg AND THE MAIN THEOREM 


In the following definition and hypothesis, 8 is a nonnegative num- 
ber. 
Definition of Sg: Sg is the set of operators H in L” with the following 
property: For each v € L” such that |v| ~ 0, there is an index k such 
that |v, | # 0 and (vz,(Hv),) = Blvz| 2. 

The definition of Sg is related to one of the equivalent definitions of 
a ‘““P-matrix.” 9 
H.1: If 8 = 0, there is a positive constant 6 such that 


|Bjw| < 6|w| (5) 


for w € L and all j. 

Our main result is the following. 
Theorem 1: Let H.1 (and A.1 through A.3) hold. Then (4) is L-stable 
ifA € Sgwithat B>0. 


* In order to focus attention only on essentials, we are proceeding with some assumptions 
that are stronger than necessary. It will become clear that (2) and (8) (and also (5) of Section 
III) could have been replaced with somewhat weaker inequalities (similar, for example, 
to some of those used in Section 5.3 of Ref. 8). Similarly, if for example there are positive 
constants a; such that (Bjw,w) = a;|w|? for w € L and all j, and if A in (4) is represented 
by an nXn matrix in the sense of Section IV with [J + a diag (a;)] invertible, then it is clear 
that x satisfies an equation similar to (4) in which each B;x; is replaced with (Bjx; — a,x;) 
and A and y are modified accordingly. Consideration of such a modified equation often 
enables a useful trade-off to be made between requirements on A and the degree of posi- 
tiveness of the B;. 
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Proof of Theorem 1: Let H.1 and A.1 through A.3 be satisfied, let y © 
L", let x © K” be a solution of (4), and assume that A € Sg with 
at+Bp>0. 

Suppose first that 6 = 0, in which case a > 0. Let r = 0 be arbitrary. 
Using (1) and the casuality of the A;;, we have 


Px: +P, AjjPBjx; ad Py; (6) 
J 
and 


(P,BixiP.x;) + (P,BixyP, ¥ AyP-Bjxj) = (P:BixiP.0) (7) 
J 


for i= | eee 5 Since A E Ss, and (P,B;x;,P, 2jAijP,B;x;) = 
(P,Bix;,2;A;jP,B;x;) for each i, there is an index k such that 
(P,Brpx,,P,2j;ApjP,Bjx;) 2 0 and hence such that 


(P,Brxp,P xz) S (P,;Bpxp,P yp). (8) 


By the Schwarz inequality and the fact that (P,Bpx,,Pryp) = 
(P,BrP,xp,PrYp) = (BrP Xp,P Yn), (P-BrxpPvr) S |BrP xx| + |Pryel. 
Therefore, using (P,Bpxp,P,x,) = (BrP,xp,P;Xp) as well as (2), (5) and 
(8), we have 


a|P,x,|? < 6|P,x,| - |P-yx| (9) 
and consequently, with c = a7}, 
|P,xp| = c|P,ye|. 


The argument given above shows that |P,x;,| < c|y| for some k (which 
might depend on x and 7). Let J denote any nonempty proper subset 
of {1,2,... ,n} with the following property. For j € J, there is a constant 
c; that depends only on c, 6, and y such that |P,x;| < cj|y|. Using 
(1), 


Xi + x A, Bx; Sg >» A; B;x;, Ll & J. (10) 
i€JT JES 


The left side of (10) is basically the same in form as the left side of (1). 
With r the number of elements contained in J, let the elements of 
({1,2,... ,n}—J) bejiJo, --- J(n—r) ordered so that jy <jo<+*+<J(n-p). 
With respect to that ordering, let the mapping of K‘"—”) into itself as- 
sociated with (10) that corresponds to A be denoted by A,. Since A.3 
holds, each A;; maps the zero element of L into itself, and it is a simple 
matter to verify that A, belongs to, so to speak, Sg with n replaced with 
(n —r). Thus, by the argument given above, we find that there is an index 
1 € J such that 


|P,x;| Sc E (11) 





P, (91 — X AyBjx;) 
JET 
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Using |P,x;| < cj|y| for j € J, as well as (11) and the causality of the 
Ajj and the B;, we have 


IP.x)| <c (\y1 if Pe P,AyB;x; 
JEd 





<c (191 + oe |P,AyP,B;P_)|) 
jEdT 
Sc (191 +76 > |P,x;l) 
jEd 


<c e yo > cj) ly|. 
JES 
Let w1,w9,...,w, be defined by w; = c and 
(1) 
wi =e (1470 > wi), i = 2,3,...,n. 
j=l 


We have shown that given x, 7, and i, |P,x;| < d;|y| for some d; € 
{wy,W9,...,Wn}. Since w, = max;w;, we have 


n 
> |P,x;|2 S nw,2|y|2 for all z 2 0, 
i=l 


which shows that x € L” and that |x| is suitably bounded in terms of 
|y|. This completes the proof for the case in which 6 = 0. 

The proof for the 8 > O’case is similar. Using primarily (7) and the 
hypothesis that A € Sz, we find that 


B|P,Brxr|? < |P,Brxe|-|P-ye| 
for some k. Therefore, 
|P;Brxz| < B|y| (12) 


for some k. By proceeding essentially as indicated above, we can show 
that 


|P,B;x;| < Qn|y| for alli and z 2 0, (13) 
in which Q,, is the number defined by Q; = 67! and 
(i-1) 
Q; = B-} (uy Dw 9) 
j=l 


fori = 2,3,...,n. 
From (6) and (13), 


Px s |P-y;| fe 2 |A;;P,B;x;| 
J 
< ly] +O y2nly! 
J 
<(1+nyQ,,)ly| 
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for each i and all 7 2 0. Since this shows that x € L and that |x| is 
bounded as required, our proof is complete. 


3.1 Comments 


To see that U,>0 S,, contains the familiar set of nonnegative operators 
defined on a space of vector-valued functions, let H be any mapping of 
L” into itself such that 


¥ (w;,(Hw);) = olw|2, w EL, (13) 
i=1 


in which o is a nonnegative constant. From (13) it is clear that for each 
w € L” with |w| =~ 0 there is an index k such that |w,| ~ 0 and 
(w;,(Hw),) = B|w|? in which 8 = on—!. Since |w|? = |w;|2, we observe 
that H € Sz. 

Theorem 1 is of course a result concerning the L-boundedness of so- 

lutions of (4).* By modifying the hypotheses and proof of Theorem 1 
in a direct manner, an analogous result can be obtained concerning the 
L-continuity of solutions (i.e., concerning the L-boundedness of the 
difference (xz — x,) of a solution x, of (4) that corresponds to y = y, and 
a solution x, that corresponds to y = yz with (yg — y,) € L). With regard 
to the necessary modifications of the hypotheses concerning A, the fol- 
lowing definition, in which 8 = 0, plays a central role. 
Definition of Tg: Tg is the set of operators H in L” with the following 
property: For each u and v in L” such that |u — v| # 0, there is an index 
k such that Ju, — vz,| 4 0 and (uz — vp, (Hu), — (Hv),) = Blu, - 
Vk | 2, 

In order to be more explicit, let (A.1’) denote the assumption that x, 
+ ABxg = yq and x, + ABx, = yp in which each Aj; and B; are causal 
operators in K and xq, Xp, Ya, and yp belong to K. Let A.2’ be the hy- 
pothesis obtained from A.2 by replacing “(Bjw,w) = a|w|? for w € L 
and all j” with “(Bju — Bjv,u — v) = alu — v|? for u and v in L and all 
J,” and let A.3’ and H.1’ be the hypotheses obtained in a similar manner 
from A.3 and H.1, respectively. 

Our L-continuity result (whose proof is omitted) is the following. 
Theorem 2: Let H.1’ and A.1’ through A.3’ be satisfied, let (yg — yy) € 
L, and let A € Tgwith a+ B > 0. Then (xq — xy) € L, and there isa 
nonnegative continuous function p defined on [0,~) that depends only 
on Aand B such that p(0) = Oand |xq — xo| S p(|ya — yo|). 


* Results along the lines of Theorem 1 for cases in which B is more general than assumed 
here but both A and B are nonnegative operators are given in Ref. 8, where the stability 
of interconnected systems in the sense of Section 2.2 is not explicitly discussed. A non- 
negative-operator approach to the stability of interconnected systems, as well as its relation 
to other approaches, is discussed in Ref. 7. 
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3.2 A Corollary to Theorem 1 


We shall refer to the following two hypotheses. 
H.2: Each B; is a continuous mapping in L that maps the zero element 
into itself, and there are positive constants c; and c2 such that for 
all 1 


(Bju,u) = c1|Byu|? (14) 
|Byu — Byv| < celu — v| (15) 


for u and v in L. 
H.3: K = Eo, L = Lo, and for each rt = 0 P, is the operator associated with 
E> in the example given in Section 2.1. 

There are many cases in which (14) holds* for all u for some c, > 0, 
but there is no positive a such that (2) is satisfied for all w.t On the other 
hand, it is clear that there is a positive c, with the property that (14) is 
met when (2) holds with a > 0 for all w and there is a positive constant 
6 such that |B;w| < 6|w| for all w. 

Corollary 1: Let H.2 and H.3 (as well as A.1 and A.3) be satisfied. If A 
€ So, then (4) is L-stable. 

Proof: Assume that the hypotheses of the corollary are satisfied and let 
I and I, respectively, denote the identity operators in K and K”. With 
regard to the following lemma, two elements u and v of E2 are taken to 
be the same if and only if |P,(u — v)| = 0 for all 7 2 0. 

Lemma 1: Let H.3 hold, let F be a continuous mapping of Lz into itself 
such that for some positive constant c < 1 we have 


|Fu —- Fvo| <c|u-—v| foruandvinL», (16) 


and let F also be a causal mapping of Ey, into E>. Then (I — F)—! exists 
and is causal on both Ly and Eo. 

Proof of Lemma 1: Let the hypotheses of the lemma be met. In view of 
(16) and the continuity of F, the equation x — Fx = h withh € Lo has 
in Ly a unique solution x which is given by x = lim,-... x ™ in which x 
=h+Fx"-1forn = 1andx© =h. Thus, (J — F)~! exists on Lo, and 
since h + F(-) is causal on Lo so is UJ — F)71. 

Now let h € Eo, and for each 7 = 0 let z, be the unique element of Lo 
that satisfies z, — Fz, = P,h. Since (J — F)—! is causal on Lg, it is clear 
that P,,2,, = P,,27, for t2 2 71. Let x be the element of E2 defined by the 
condition that P,x = P,z, for all 7 = 0. For any 7 2 0, P;x — P,Fx = P,2, 
— P,FP,x = P,z,—P,FP,z,= P,z, —P,Fz,=P,h. Therefore x satisfies 
x — Fx = h. Suppose that x; and x2 in E¢ satisfy h = x1 — Fx; =xo— Fxe 


* This type of inequality is among those used in Ref. 8. 

1 We mention two simple examples: Let L = Le and let B; be defined by the condition 
that for each t = 0, (B;w)(t) = w(t) for |w(t)| < 1 and (B,w)(t) = sgn(w(t)) for |w(t)| > 
1. Then (14) with c, = 1 holds for all u € Lo, but there is no a > 0 for which (2) is satisfied 
for all w € Lo. It is not difficult to show that a similar conclusion is reached when B; is the 
convolution operator in L2 with impulse response e~*. 
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with |P,(x; — x2)| + 0 for some 7 = 0. Since F is causal, we have Ph 
= Phe in which h, = P,x,; — FP,x; and hy = P,;x»_ — FP,x». This con- 
tradicts the fact that (I — F)—! is causal on Ly. Thus, x is the unique 
solution in E> of x — Fx = h, which means that (J — F)~! exists on Eo, 
because h is arbitrary. In view of the fact that the solution x of x — Fx 
= h satisfies P,x = P,z, where z, — Fz, = P,h for every r = 0, it is evi- 
dent that the operator (J — F)-! on Eg is causal. This proves the 
lemma. 

Let c be a positive constant such that c < min (cj,c9~!). By Lemma 
1, 7, — cB)—! exists on K” and B(I,, — cB)—! is causal on K” and maps 
L” into itself. In particular, the equation x + ABx = y can be written 
as 


h+(A+clI,)BU, — cB)~th = y 


in which h = x — cBx. From A € So, it follows at once that (A + clI,) 
E Sz. 

Also, from h = x — cBx and the fact that B is causal on K” and satisfies: 
|[Bu| <celu| foru € L", with cco < 1, we have |P,x| < (1 — cc) |P,h| 
for 7 = 0. 

Therefore, by Theorem 1, to complete the proof it suffices to observe 
that for any w € L", 


(BU, — cB)—1w,w) = (Bu,u — cBu) 
= (Bu,u) — c|Bu|? 
>0 


in which of course u = (,, — cB)71w. 


IV. RESULTS CONCERNING THE MATRIX CASE 


Of importance in the theory of interconnected systems is the special 
case in which A is represented by a real n Xn matrix a with elements a;;, 
in the sense that for each 1, 


n 
(Aw); = > a;jWj, W EC LL", 
j=l 


Throughout this section, “A € M” means that A has such a represen- 
tation with representation matrix a, and, assuming that A € M, U,(U) 
denotes the set of representation matrices such that A € So (A € Sg 
with 6 > 0). In addition, Po(P) denotes the set of real square matrices 
with nonnegative (positive) principal minors. 

Proposition 1: If (A.1 through A.3 are satisfied and) AC Muitha € 
Po, then (4) is not L-stable for some B. 

Proof: Let the hypotheses be met, and let 1,, denote the identity matrix 
of order n. Froma € Po, it follows that there is a diagonal matrix d = 
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diag (d,,do,...,d,) with d; > 0 for all i such that (d + a) and hence 
(1, + ad!) are singular.* 

For each 1, let B; be defined by the condition that B;w = d;~!w for w 
€ K. Let v be any real nonzero n-vector that is annihilated by (1, + 
ad~'!), and let e be any element of K different from the zero element 0 
of L. With x = ve, we have x + ABx = 6. This shows that (4) is not L- 
stable for the particular B constructed. 

In order to proceed to the first theorem in this section, it is necessary 
to introduce the following definitions. For each w € L", a, denotes the 
matrix obtained from a by replacing a;; with a;;(w;,w;) for all: and j. For 
anyw € L" with |w| ~ 0, I'(a,,) denotes the matrix obtained from a,, 
by deleting both the ith row and ith column for alli € {i: |w;| = 0}. 
Theorem 3: Let A € M. We havea € U(a € Up) if and only if T (ay) 
€ P(ay € Po) for each w € L" with |w| # 0. 

Proof of Theorem 3: We shall use two lemmas. With regard to the first 
of the lemmas, M,, denotes the normed linear space of real n Xn matrices, 
with the usual Euclidean norm, and C denotes {u € M,: there isaw € 
L” with |w| = 1 such that Ujj = (w;,w;) for alli and 7}. 

Lemma 2: C is compact. 

Proof of Lemma 2: The set C is obviously bounded. To show that C is 
closed, let u™,u), . .. be a sequence of elements of C that converges to 
some element u of M,. 

Given a real n-vector v, for each w € L” and its corresponding element 
u of C, we have v'"uv = (2jv;w;, Djv;w;) = 0.1 Thus, each u) is nonne- 
gative definite, and therefore it follows that @ is nonnegative definite. 
In view of the fact that Tr(u%)) = 1 for all 7, it also follows that Tr(a) = 
1. 

Since @ is nonnegative definite and has unit trace, there is an or- 
thogonal matrix T and a diagonal matrix D = diag (d1,d9,...,d,) with 
d; 2 Oand 2d; = 1 such that 


“a= TDT'= ~ da, T)(T))*, 


in which T; is the /th column of T. Referring to the pairwise mutually 
orthogonal elements €1,€9, .. . ,e, mentioned in Section 2.1, let z in L” 
be defined by 


em Dod, Tie. 
i 
Using the orthonormality of the e;, and the fact that for each / the sum 
of the squares of the components of T; is unity, it is not difficult to verify 
that |z| = 1, and that (z;,z;) is equal to the i,jth element of d for alli and 


* A proof is given in Ref. 10. Another proof can be obtained from the fact that, since a & 
Po, there is (see Ref. 9) a real nonzero n-vector g such that q;(aq); < 0 for every i such that 
qi 0. 

t The superscript “tr” denotes transpose. 
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j. This shows that C is closed, and completes the proof of the 
lemma.* 

The following lemma is proved in Ref. 9. 

Lemma 3: A real square matrix m belongs to P(P9) if and only if vz, (mv), 
> 0 (vp, + Oand vz (mv), = 0) for some k for each real nonzero vector 
v of dimension equal to the order of m. 

In order to prove the theorem, suppose initially that I'(a,,) € P for 
every w © L” with |w| # 0. By Lemma 3, for each w € L” with |w| = 
1 we have (T'(a,,)v), > 0 for some index k, when all of the components 
of the vector v of compatible dimension are unity. Thus, max; 2}_; 
a;;(w;,w;), which we view as a function of the matrix u whose elements 
are the (w;,w;), is positive for each w in L” with unit norm. Since max; 
271 a;j(W;,W;) is obviously a continuous function of u, and, by Lemma 
2, C is compact, there is a o > 0 such that 

min max 3 ajj(w;,W;) = o. (17) 
. uEC i j=l 
Therefore, for each w € L” with |w| = 1 there is an index k such that 


n 
pe Apj (Wp,W;) = o|w|2 = o|w,| 2, 
j=1 


from which we see that for each w € L” with |w| # 0, there isak such 
that |w,| # 0 and 


n 
DV apj(wp,wj) = o|wp|?. 
j=1 


Thus, a € U. 

To show that a € Up when a, € Po (and hence I'(a,,) € Po) for each 
w € L" with |w| > 0, we observe that then, by Lemma 3, for each w € 
L” with |w| ¥ 0 we have (I'(a,,)v), = 0 for some k when the components 
of v are all unity. Therefore, for each w € L” with |w| + 0, there isak 
such that |w;| ~ 0 and 


X Apj(Wp,W;) = O, 
j= 
which means that a € U5. 

Suppose now that for some w € L” with |w| > 0 we have I'(a,,) € 
P(ay € Po). Then, by Lemma 3, there is a nonzero vector v such that 
Up (T(ay)v)z S 0 (u,(T(ay)v), <0) for every k such that v;, ~ 0. Thus, 
by multiplying each w; for which |w;| + 0 by the appropriate component 
of uv, it is a simple matter to construct az © L” for which |z| 4 0 and 
2 jij (z;,2;) 0 for all i(2jaj;(zi,2;) <0 for alli such that |z;| # 0). This 
completes the proof of the theorem. 

Corollary 2: If n = 3, U(Uo) is a proper subset of the matrices of order 
nin P(Po). 


* Of some peripheral interest is the fact that it is not necessary to assume that L is 
complete. 
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Proof: To see that U(Up) is asubset of P(Po), leta € U(Up), let e be any 
element of L such that |e] = 1, and let w be the element of L” defined 
by w; = e for alli. Thus a,, = a, and, by Theorem 3, a € P(P9). 

In order to show that for n = 3 there is a matrix of order n in P(P) that 
is not contained in U(Up), observe that it is sufficient to consider the 
n = 8 case, and let a‘+) and a) be defined by 


11 1 —10 
aH=11 11 1 | 

1 1 1.1 

1 1 -10 
a®M=]1 1 1}. 

1 1 1 


We have a‘*) € P and a®) € Po. Let w € L3 be given by 


1 0 
w= 1 ey t+ 1 €2, 
1 10 


in which e, and eg are orthogonal elements of L with unit norm. It is a 
simple matter to verify that 


11 1 -10 
a=) 4d 22 11 |, 
1. 4114 
1 1 -10 
aO=l|1- 2 11 
1 11 101 


and that we have det[a‘*] < 0 and det[a] <0. By Theorem 38, this shows 
that a‘*)(a)) € U(Up), which completes the proof. 
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Corollary 3: If n = 2, U(Up) = P(Po) restricted to 2X2 matrices. 
This follows directly from Theorem 3.* 


4.1 Definitions 


Let N denote the set of real symmetric nonnegative definite matrices 
m of order n with m;; > 0 for some. For each m € N, let a,, denote the 
nXn matrix whose 1,jth element is a;;7;; for alli and j,* and let A(a,,) 
denote the matrix obtained from a,, by deleting row i and column i for 
eachi € {j: mj; = 0}. 

Corollary 4: We have a € U(U>) if and only if A(a,,) € P(am € Po) for 
each m € N. 

Proof: The proof of Lemma 2! shows that a real matrix m of order n 
belongs to N if and only if there is a w in L” such that |w| ~ 0 and m,; 
= (w;,w;) for alli andj. Thus, Corollary 4 follows from Theorem 3. 


4.2 Introduction to Corollary 5 


In order to present our next corollary, we need the following additional 
definitions: Let S(m) denote the set of all matrices obtainable from a 
given real n Xn matrix m by replacing each off-diagonal element mj; of 
m with r;;m,;, where the r;; are real numbers that satisfy rj; = rj; and |r;j| 
< 1. Let R denote {m € P: S(m) C P}, and, similarly, let Ro = {m € Po: 
S(m) C Po}. 

When n = 2 and P and Po are restricted to 2X2 matrices, we have R 
= P and Ro = Po. On the other hand, if we let a{+)(A) and a)(X), re- 
spectively, denote the matrices obtained from a‘*) and a of the proof 
of Corollary 2 by multiplying the (1,3) and (8,1) elements by a scalar 
variable \, then a‘+)(1) © P and a(1) € Po, but a‘+)(0) & P, and, 
similarly, a (0) € Po. This shows that R(Ro) is a proper subset of the 
nXn matrices in P(Po) when n = 3.5 Two familiar classes of matrices 
contained, for example, in Ro are the set of row-sum dominant matrices 
and the set of column-sum dominant matrices. 

Corollary 5: We have a € U(U)) if either 

(i) aC R(Ro). 

(it) There are diagonal matrices d; and dz of order n with positive di- 
agonal elements such that d,adz ts positive definite (nonnegative 
definite). 

Proof: Suppose first that a € R(Ro). Let w be any element of L” such 
that |w| # 0, let c = diag(cj,co,... ,¢n) in which for alli, c; = Oif |u;| 


* The proof of Corollary 2 shows that U C P and Uo C Po forn = 2. 

+ In other words, let a,, denote the “Schur product” of a and m. 

t Lemma 2 is used i in the proof of Theorem 38. 

8 It will become clear that this proposition also follows from Corollary 2 and Corol- 


lar 
VAS usual, we say that a real square matrix m is positive definite (nonnegative definite) 


if and only if the symmetric part of m is the matrix of a positive definite (nonnegative 
definite) quadratic form. 
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= Oand ¢c; = |w;|~! if |w;| > 0, and let ['(ca,,c) denote the matrix ob- 
tained from (ca,,c) by deleting the rows and columns corresponding to 
the indices i for which c; = 0. In view of the fact that |(w;,w;)| < |w;|-|w,| 
for each and J, we see that I'(ca,,c) € P(Po) and hence* that I'(a,,) € 
P(Po). By Theorem 3, a © U(U5). 

At this point, we need the following lemma. 
Lemma 4%: If p = \p;;} and q = {q;;} are real square matrices of the same 
order, with p positive definite and q symmetric, nonnegative definite, 
and such that q; > 0 for alli, then r = {p;jq;j;} 1s positive definite. 
Proof of Lemma 4: Let p and q be as indicated, and let k denote the order 
of p. The proof of Lemma 2 shows that Le contains k functions 
fifo, ..- ofp such that 


ij = f, * ft)f;(t)dt for alli and j. 


With v any real nonzero k-vector and with \ the smallest eigenvalue of 
the symmetric part of p, we have 


vrv = 2 U{D;Pij f. f(t) fj(t)dt 
oy) : dX pivifi(t )vjfj (C)dt 
uJ 


> f, AE (oifi(t))Pat 
> 0, 


which shows that r is positive definite. 

To complete the proof of the corollary, suppose that d,ad¢ is positive 
definite, with d; and dy» as described, and let m € N. By Lemma 4, 
A{d amd} (i.e., A{a,,} with a replaced with d,ad_) is positive definite and 
hence it belongs to P. Therefore, Afa,,} © P, and, by Corollary 4, 
ac U. 

The proof for the case in which d,adz is nonnegative definite is es- 
sentially the same, and is omitted. 


4.3 Comments Regarding Corollary 5 


In light of the fact that Ro = Po restricted to 2X2 matrices when n = 
2, the following special result is a direct consequence of Corollaries 5 and 
1, and the content of the proof of Proposition 1. 
Proposition 2: Letn = 2and A € M. Let H.2 and H.3 (as well as A.1 and 


* Here and in another part of the proof, we use the easily proved result that a real square 
matrix m belongs to P(Pp) if and only if dymd2z € P(Pp) for every pair of compatible di- 
agonal matrices d, and d2 with positive diagonal elements. 

- 2 proof that the conclusion of Lemma 4 holds when gq is positive definite is given in 
ef. 11. 
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A.3) be satisfied. Then (4) is L-stable for every B if and only if a € 
Po. 

An example of a matrix a that is nonnegative definite and such that 
a € Rois given by 


1 1 1 
a=I/1 91/2 91/2 . 
1 21/2 91/2 


since 


1 1 0 
det}1 2!/2 921/21 <0. 
Q 21/2 91/2 


Similarly, a very simple example of an a € Ro such that d,adz is non- 
negative definite for no suitable d; and dg is 


lo ol 
a= : 
0 0 

Of some interest is the fact that a € U ifa is an M-matrix (i.e., if a 
has positive principal minors and nonpositive off-diagonal elements); 
in that case there is a diagonal matrix d with positive diagonal elements 
such that ad is strongly row-sum dominant* and therefore ad and con- 
sequently a belong to R. 

Theorem 4: Let A € M. If a;; = 0 for alli, thena € Upif and only if a 
E Ro. 
Proof: The “if part” is a special case of Corollary 5. 

Suppose that a € Up with a;; = 0 for alli, and suppose also that a € 
Ro in which case there is an element b of S(a) such that b € Pp. Let b 
be given by bj; = 0 for alli, and bj; = rjjajj with rj; = rj; fori + J. Choose 
n real numbers 11,129, . . - ’nn SO that the nXn matrix m given by mij; 
= rij for alli andj is nonnegative definite. Observe that m € N. Since 
Am = 6, by Corollary 4, we have a contradiction to the supposition that 
a € Up». Therefore, a € Ro when a € Uo and a;; = 0 for all 1, which 
completes the proof of the theorem. 


4.4 Comment regarding Theorem 4 


We can have a € Po with a;; = 0 for alli, anda € Ro. For example, 
let 


0 -1 1 
a=|1 0 
0 1 0 


* See the theorem given on page 387 of Ref. 12. 
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Fig. 1—Flow graph of an interconnected system. 


Then a € Po, and 


0 Te T13 
det T12 0 0 = ro3l19" 13 <0 
0 T93 0 


for, say, re3 = 1g = 113 = 1. 


4.5 A Specific example of an L-stable interconnected system 


Assume that H.2 and H.3 (as well as A.1 and A.3) are satisfied. 
For the system described in flow-graph form in Fig. 1, we have 


Yr = %1 
Yo = Xq— Byx, + Bgx3 — Baxg 
V3 = X3 — 2Boxo + 2Byx4 
ya = X4 + Boxe — Bgx3. 
Here A € M, with 


a= 
0 -2 0 2 
0 i. 
To see that a € Ro, consider the matrix 
0 0 0 0 
Tq, 0 ro3. Tag 
0 2s, 0 rsa] 
0 roa 34—ClC«OO 
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We observe that its determinant vanishes and every 1X1 and 2X2 
principal minor is nonnegative for all real values of the r;;. It is a simple 
matter to verify that its principal minor of order three obtained by de- 
leting the first row and first column vanishes for all values of the r;;, and 
it is clear that every other principal minor of order three also vanishes 
for all values of the rj;. 

Since a € Ro, by Corollary 1 and either Corollary 5 or Theorem 4, the 
system described in Fig. 1 is L-stable. 

Another way to prove that the system in Fig. 1 is L-stable is as follows. 
Since H.2 holds, |Byu| < co|u| foru € L. It therefore suffices to show 
the L-stability of the system obtained from the flow graph in Fig. 1 by 
deleting B,, x,, and y;. That can be done with the aid of Corollary 5 by 
verifying that the interconnection matrix a of the modified system has 
the property that there is a 3X3 diagonal matrix d with positive diagonal 
elements such that da is nonnegative definite. 
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A Note Concerning Optical-Waveguide 
Modulation Transfer Functions 
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(Manuscript received March 9, 1978) 


A necessary and sufficient condition is given for the modulation- 
transfer-function of certain multimode optical fiber guides to be zero 
free in the closed right-half of the complex plane, and to be structurally 
stable with respect to that property. The condition is of interest, for 
example, in connection with the possibility of determining the phase 
of a modulation-transfer-function from its amplitude. 


I. INTRODUCTION AND PRELIMINARIES 


Reference 1 considers the range of validity of a Hilbert-transform 
approach in which the measured magnitude of the modulation-trans- 
fer-function of an optical fiber guide is used to compute the guide’s im- 
pulse response.* It is argued there that a key “minimum-phase as- 
sumption” can fail to be satisfied in important cases, and a few closely 
related experimental and analytical results are presented. 

The purpose of this note is to report on a result along the same lines 
as a proposition given in Ref. 1 to the effect that, for a fiber guide that 
can propagate a finite number of discrete modes without mode mixing, 
the modulation-transfer-function (more precisely, the Laplace transform 
version of the modulation-transfer-function) is zero-free in the closed 
right half of the complex plane, and that property is structurally stable 
in a certain sense, if and only if a certain condition is met. The theorem 
described in Section II is concerned with a more realistic and far more 
interesting case in which mode mixing is not ruled out. In particular, the 
result provides further detailed support for the conclusion reached in 


* By “the guide’s impulse response” is meant the output power of the guide excited by 
a unit impulse of optical power. The modulation-transfer-function G(w) is the envelope 
response of the fiber guide to an incoherent optical signal sinusoidally modulated at angular 
frequency w. To the extent that certain approximations hold, (Ref. 2), the impulse response 
is the Fourier transform of the modulation-transfer-function. The reason for considering 
the Hilbert transform approach is that it is often desirable to determine the impulse re- 
sponse of fiber guides by methods other than direct time-domain measurement, and, while 
|G(w)| can be measured easily, it is at the present time difficult to accurately measure the 
phase of G(w). (See Refs. 3 and 4.) 
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Ref. 1. (i.e., that “nonminimum-phase behavior” is likely to arise, and 
can arise, in important actual cases). As in Ref. 1, it is the introduction 
and use of the concept of structural stability of a mathematical property 
which, for the case considered, enables a result to be obtained that is 
complete and easy to interpret.* 

We consider, as in Ref. 1, a class of optical fiber guides for which ap- 
propriate approximations can be made so that the modulation-trans- 
fer-function G(w) of a guide can be written in the form 


Ge) = f, e-ietda(z), (1) 


in which T denotes a closed, finite, real interval whose end points depend 
on the refractive indices of the core and cladding, and a(r) is a real-valued 
nondecreasing function.‘ It is assumed throughout that a(7) satisfies 
the normalization condition 


f, dae) = 


As mentioned previously, in Ref. 1 attention is focused on the class of 
fiber guides that can propagate n discrete modes without mode mixing.! 
In that case, G(w) can be written as 


 djemier, (2) 
J=1 


in which each d; is a positive number that represents the initial excitation 
of the jth mode, and 7; < 72 <... < Tp. 

In this note we suppose that the right side of (1) can be expressed 
as 


k 
5 djeiwr; + Si e-ieth(z)dz, (3) 
j=l T 


in which: kr is a positive integer (the motivation for using the subscript 
F will become clear shortly), d; > O for] <j Skp, 71 <12<...< Trp, 
and b(7) is a bounded piecewise-continuous$ nonnegative function which 
takes into account mode mixing. It is also assumed that 7, has the fol- 
lowing property: b(r) = 0 for 7 e T with 7 < 71 (which, of course, allows 
the possibility, but does not require, that 7; is equal to the lower endpoint 


* The general problem of determining when the Hilbert-transform approach (i.e., when 
the so-called Kramers-Kronig transformation) is valid is of interest in many fields (see, 
for example, Ref. 5). 

1 Thus, roughly speaking, da(z) in (1) can be replaced with f(1)d7 in which the function 
f(r) is nonnegative and may contain impulses corresponding to discrete modes. See Refs. 
2 and 3 for the relevant background material. We are assuming that material dispersion 
can be neglected. 

Typically, n > 100. 

* The piecewise-continuity assumption appears to be adequate for applications. For 
basically a somewhat more general version of the theorem stated in Section II, see Section 
Ded, 
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of T). Since (8) is a specialization of the right side of (1), we also have 
7; ¢ T for each j, as well as 


kr 
Sd; + ff, b(r)dr = 1. 
j=l T 


In physical terms, b is the relative-power density function associated 
with the nondiscrete modes, and the idealized impulse response of the 
guide is the inverse Fourier transform of (3). The observable impulse 
response of the guide (i.e., the impulse response of the guide-detector 
combination) is a somewhat smoothed version of the idealized response, 
with the smoothing provided by the detector (see Ref. 3). 

The important assumption that d; > 0 and that b(7) = 0 for 7 « T with 
7 <7, means that a discrete mode corresponding to the smallest modal 
delay is propagated. This assumption appears to be reasonable for at 
least some interesting classes of guides. For example, if a guide with a 
step-index profile is short enough to neglect mode conversion phe- 
nomena, then it is not unreasonable to assume that G(w) has the form 
given in (2) with 7; the modal delay corresponding to the fundamental 
mode. In a real fiber, geometrical perturbations couple energy among 
the modes so that the distribution of modal delays changes continuously 
from a discrete set to a continuum as the fiber length L increases. Ex- 
perimental evidence indicates that the assumption is reasonable at least 
if the guide is not too long.* (For a particular fiber, there is a charac- 
teristic coupling length L, such that for L > L,, it is difficult in the time 
domain to isolate discrete modes with appreciable energy.) 


il. THE RESULT 
In this section, z denotes a complex variable and F(z) is defined by 


kr ; 
FG) = 3 diet + J, e-2tb(r)dr (4) 


for each z. (Of course, if G(w) denotes (3), then G(w) = F(iw), and F(z) 
is simply the Laplace transform of the generalized function whose 
Fourier transform is G(w).) 

In order to state our result, consider an arbitrary function H(z) of the 
same type as F(z). More explicitly, let H(z) be given for all z by 


H = gH —2t; =2T d 
@)= 2X dja 7 Sie B(r)dr, (5) 


in which ky is a positive integer (not necessarily equal to kr), and the 
6;, the t;, and 8(r) satisfy the restrictions imposed on the corresponding 
terms in (4). Let S denote the set of all such functions H(z). 


* The writer is indebted to his colleague I. P. Kaminow for a helpful discussion con- 
cerning the significance of the assumption described above. 
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For the purpose of defining a “distance” between F and an arbitrary 
element H of S, let Jr, Jy, and J, respectively, denote the sets of num- 
bers {1,2,... ,kr}, {1,2,...,Ry}, and {1,2,...,min(kp,ky)}, and, with y a 
real variable, let q(y) denote any continuous nondecreasing function of 
y such that q(0) = 0. 

Let the “distance” p(F,H) between F and any H in S be defined 
by* 


AF,A)= > |dgj-Slt+ YD d+ FY y 
fed he pea. Sega) 





f ” [b(r) — Bz]. 6) 


Each term on the right side of (6) has a direct interpretation. In par- 
ticular, 


+q (max I7;- tl) + max 
jed 


uve T 


dj + a dj; 
je (Jr-J) je (JH-J) 
in which at most one sum is nonzero, reflects the extent to which terms 
in one of the two finite sums in (4) and (5) do not have counterparts in 
the other. Also, 


f ” [b(r) — Br) dz 


is an integral of the difference of two power-density functions, and, 
roughly speaking, if 


‘max 
u,v e T 





§ ° [b() — B(x) dr 


is sufficiently small, then, for practical purposes, the functions b and 8 
are indistinguishable in the sense that the observable impulse response 
of the guide is essentially unchanged if 6 is replaced with 8. (The portion 
of the idealized impulse response that does not contain impulses is g 
defined by g(t) = b(t) fort e T and g(t) = Ofort ¢ T. If, for example, the 
smoothing introduced by the detector is modeled by a filter with impulse 
response r given by r(t) = p—! for ¢ « [0, p] and r(t) = 0 otherwise, in 
which p is asmall positive constant, then the observable version of g(t) 
isp! fi-,g(7)dr for each t. Similarly, if instead r(t) = 0 for t < 0, r(0) 
is finite, and the derivative of r is absolutely integrable on [0,~), then 
an integration by parts shows that the observable version of g is essen- 
tially unchanged when b is replaced with a sufficiently nearby 6 in the 
sense indicated above.) 

Our result, the theorem given below, provides an answer to the fol- 
lowing question: When is it true that F(z) ~ 0 for Re(z) = 0 and that 


* We adopt the convention that a sum over the empty set is zero. 
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property of F is structurally stable in the sense that there is a positive 
constant ¢ (which can be thought of as a “‘tolerance”’) such that H(z) # 
0 for Re(z) = 0 for every H in S such that p(F,\H) < «. 

Theorem: We have F(z) # 0 for Re(z) = 0 with that property of F 
structurally stable, if and only if 


a> dj+ f b(a)dr. 
j T 


edF 
J#1 


Note that the theorem* does not rule out the possibility that the 
condition given in the theorem is violated and F(z) is zero-free in the 
closed right-half plane. (In fact, an example given in Ref. 1 shows that 
the possibility can occur. Essentially the same example can be used to 
show also that if the condition is violated, then it need not be the case 
that all functions in S “‘sufficiently close” to but different from F have 
a zero in Re(z) = 0.) On the other hand, at present it appears that there 
are complex, and for practical purposes impossible-to-specify, additional 
relationships among the 7;, the dj, and b that, in particular take into 
account geometrical perturbations along the length of a real guide. The 
theorem shows that, when additional information is unavailable, it is 
not possible to prove that F(z) is zero-free in the closed right-half plane 
whenever 


Bes dj+ f° b(r)dr 
jedr T 


J¥1 


(which, in view of the normalization condition concerning a, is equivalent 
to the statement that the discrete mode with the smallest delay has 
relative power less than 44) and, in the sense indicated above, the 7;, the 
d;, and b are known only to within some tolerance, no matter how small 
the tolerance is. 

A proof of the theorem is given in the next section. 


* As indicated earlier, one application of the theorem is that it provides further detailed 
support for the material reported on in Ref. 1. For the benefit of the reader who has not 
read Ref. 1, we mention that a much simplified version of essentially the proof given in 
Section 2.1 can be used to show that if kr = 2, if b(7) and G(7) in (4) and (5), respectively, 
oe each replaced by the zero function, if S is ‘further restricted so that ky = kp and 6; = 

for j = 1,2,...,krforallH eS, and if p(F,H) is instead q(max; . s|7; — t;|) [i-e- _ just 
the fourth term in the sum on the right side of (6)], then: F(z) ~ 0 for Re(z) = 0 and there 
is an « > O such that H(z) = 0 for Re(z) = 0 for every H in the corresponding S with p(F,H) 
< «¢, if and only if dj >)? £, d;. This result is basically a slight generalization of the com- 
parable proposition in Ref. i. 


OPTICAL WAVEGUIDE TRANSFER FUNCTIONS 3051 


2.1 Proof of the theorem 


For the reader’s convenience, we first repeat some of the material 
described above. We have T = [T1,T2] in which T; and T> are real 
numbers such that T, < Te, and S denotes the set of all functions of the 
complex variable z of the form (5) where ky is a positive integer, 5; > 
Oforl <j Sky, T1 Sti <to<...< tp, < To, B(7) is a nonnegative 
bounded piecewise-continuous* function defined on T' such that B(7) 
= 0 for ze T with tr < 71, and 


ku 
Lat f, B(r)dr = 1. 


The “distance” p(F,H) between any H «¢ S and the particular element 
F of S given by (4), is defined by (6). 
Proof of the “If” Part: Suppose that 


di> dX djt+ b(r)dr. 
jedF T 
j#l 
With 
P= =>) dp = ff, b(z)dr, 
jedr £ 
J#1 
let ¢ satisfy 0 < « < (1/4)r. For each H ¢ S with p(F,A) < «, we see that 
|d, = by| < €, 


> \d; —5;| S«, > dx. > gars 
jed je (Jr—J) je (Jy—-d) 
j¥l 

as well as 


<6 


b(r)dr — B(r)dr 
T T 


OL Op B(7)dr. 
jedH i 
j#1 


Thus, for Re(z) = 0 and H < S with p(F,H) < «, we have 





and therefore 


Jez=4H(z)|=|d1+ Yo dye zit) + Sf, et B(7)dr 
jedH i 
j#l 
* Tt will become evident that the theorem holds also if “piecewise continuous” is replaced 


with either “Riemann integrable” or “Lebesgue integrable.” In this connection, see Sec- 
tion 2.2. 
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>d—| LD serait + Si e- 2-8 (r) dr 
jedH EF 
j#¥l 
a ee ff, B(r)dr. (7) 
jedH T 
j#1 


Since the right side of (7) is positive, it is clear that H(z) # 0, which 
completes the proof of the “‘if’’ part. 
Proof of the “Only If” Part: Suppose now that 


a< dj+ f° d(a)dr, 
jedr i 
j#1 


and let « > 0 be given. 

Let kg = max(kp,2), let 7 = min ((1/6) e, (1/2) d;), and let 6; = d, — 
n. lf kp > 1, let dg = dg+nand 6; =d; forj e {zy eJr;j #1,2}, andifkp 
= 1, let 6g = n and 72 = T2. Then the function G defined (for all z) by 


KG 
G(z) = ¥ bje727 + ff, e-7b(r)dr 
j=l T 


belongs to S, and we have (if we set H = G): 


1 
ae ae Sa ee el (8) 
jed je Ur-J) je Ja-d) 3 
and the strict inequality 
aps i+ bladder. (9) 
jeFe T 
J#1 


Let 6, denote min{(7j+1 — 7;): 1 Sj,j +1 < kg}, and let A= sup; . 7 
b(r). With B = {7 e T:b(r) > 0}, let s; and se denote inf B and sup B, re- 
spectively, when B has nonzero measure, and TJ; and T»9, respectively, 
otherwise. 

Choose any 6, > 0 such that q(6,) < (4s) «, and let 6 be any positive 
number such that 


it, oe 1 
5 ean G bey 7 Bos & (IBA), + (2 s1)). (10) 


Let w = 76—!, and let K; denote the set of numbers of the form 7; + k6, 
with k an odd positive integer. Clearly, exp[—iw(t — 71)] = —1 fort e K5. 
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Choose t1 = 71, and (using kg = 2 and 26 < (4) 6,) for each] = 2,3,...,kg 
choose a t; in Ks ( [71,T 2] such that | 7; — t;| < 26. Since 26 < (44) 6, and 
25 < 6,, we have t) <to<...<trg and q(max;,7|7; — tj|) < CA)e. 

We see that the “distance” between F and the element EF of S given 
by 


k 
E(z) = ¥ dje724i + .e777e (a7 
j=l T 


is at most (7) e. It therefore suffices to show that there is an H in S de- 
fined by 


k 
H(z) = 3 OF ad i fi e-27B(r)dr 
j=l a 
with 

max 


max "= Bode 


such that H(z) = 0 for some z with Re(z) > 0. 

Let L = Ks () (81,82). Since 6 < 14 (so — s1),* L contains at least two 
points. Let the points in L be pj,po,...,Dn, ordered so that p ; < po < 
... <Pppn. Let o be a positive number such that o < 6, p} —s1 >, and se 
— Pn > o. With I(u,v) denoting f% b(7)dz for u,v e [s1,s9], let 8,(7) be 
defined for 7 « T by B,(r) = f(t — pi)I(s1,p1 + 5) + f(t — po)I(p2 — 4, po 
+ 6)+...4+ f(t — pr)I(pn — 6,82), in which f(t) = (20)7! for |t| < o and 
f(t) = 0 otherwise. Since 


ff, B.(r)dr = I(s1,p1 + 6) + I(po— po + 6) +... 





1 
<—e 
3 





+(p n— 5,89) = Si, b(r)dr, (11) 
we see that the function H, defined by 
k 
H(z) = ¥ 6je~24i + ff, Bo(r)e~27dr 
j=1 T 


belongs to S. 
Using I(s1,p1 + 5) S 36A,! I(p; — 6,p; + 6) S 26A forj = 2,...,(n- 
1), I(Dn = 6,89) Ss 360A, 


J, 8dr = f. Belaae 


for t = s1,p1 + 6,po + 6,... ,Dn—1 + 6,89, and the fact that b(7) and 8,(7) 
are nonnegative, we have 


t t 
| [badd — f polndar 
T1 Ti 
t See (10). 


! Notice that p; — s1 S 26. 


< 30A 
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for all t e T. It follows at once that for u,v e T, 


| ff Th Sale| ZS 6A 
< . et 
3 


Therefore p(F,H,) < ¢, uniformly for o as described. 
Let P(z) be defined for all z by 


kg 
P(z)= 3 6je 74) + e~7P1](s1,p1 + 4) 
jst 


+ e—*P2] (po _ 0,P2 + 6) +...+ e—2PrI (py ~ 6,S9). 


Let a be a real variable. Since t; e Ks; for j = 2,3,...,kg, and p; « K5 for 
j = 1,2,...,n, and (9) and (11) hold, P(a + iw) exp|(a + iw)t,]! is real 
and negative when a = 0. On the other hand, P(a + iw) exp[(a + iw)t1] 
is positive for all sufficiently large a. Thus, P(z 1) = 0 for some z; with 
Re(z1) > 0. 

The function P(z) is analytic in z throughout the complex plane. Since 
it is not identically zero and is analytic at z = 21, its zero at z = 21 is iso- 
lated. Therefore, there exists a constant r > 0 such that r < Re(z,) 
and, with [ = {z:|z — z;| = r}, P(z) # 0 for z eI. It follows that min 
{| P(z)|:z eT} is positive. 

Using the fact that 


ttoa 
(2c)-! f, e-2tdr = e~*tw(oz), 
t-o 
in which w(oz) = (20z)~1(e2" — e-27), we have 

H,(z) = P(z) + [W(oz) — 1]y(z), (12) 
where 
y(z) = e~ P11 (s1,p1 + 6) + e~7P2I (po — 6,p2 + 4) 

+...+e72PnI(p, — 6,89). 


The function |y(z) |is bounded on I, and max {|w(oz) — 1|:z e T} (and 
hence max {|[w(oz) — 1]y(z)|:z e T}) can be made arbitrarily small by 
choosing o to be sufficiently small. By Rouché’s theorem, for sufficiently 
small o, H,(z) has a zero inside I (and hence in Re(z) > 0). This com- 
pletes the proof. 


t See (10). 
t Recall that w = 7671. 
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2.2 Comment 


It is apparent that we have proved a somewhat stronger result than 
the theorem stated. Suppose that the definition of S is changed to the 
extent that G(r) need not be piecewise continuous, but merely Lebesgue 
integrable. Then it it is clear that the “if” part of the theorem remains 
true. More importantly, the proof shows that if 


a< > dj+ {' b(r)dr, 
j T 


edp 
J#1 


then, for any « > 0, there is an H in S of the form (5) with the following 
properties: p(F,H) < «, H(z) has a zero in the open right-half of the 
complex plane, ¢) = 71,ky = kp,6; = d; forj = 1,2,... ,kr, B(7) is piecewise 
constant, and the smallest closed real interval containing the support 
of b(7) (which might possibly be the “empty interval”) also contains the 
support of 6(r). 
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Some Extensions of the Ordering Techniques for 
Compression of Two-Level Facsimile Pictures 
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We present extensions of our earlier published ordering techniques 
for efficient coding of two-level (black and white) facsimile pictures. 
Ordering techniques use the two-dimensional correlation present in 
spatially close picture elements to change the relative order of trans- 
mission of elements in a scan line so as to increase the average length 
of the runs of consecutive black or white elements in the ordered line, 
making the data more amenable to one-dimensional run-length coding. 
The extensions that we consider allow us to use different run-length 
codes to match the statistics of different parts of the ordered data, and 
to drop certain runs from transmission. Computer simulations using 
the eight standard CCITT pictures, which have a resolution of ap- 
proximately 200 dots/inch, indicate that these extensions can result 
in transmission bit rates which are about 11 to 21 percent lower than 
the ordering schemes described in our earlier work. The entropies vary 
between 0.021 and 0.125 bits/pel for the eight pictures. 


1. INTRODUCTION 


Coding of two-tone (black and white) facsimile pictures has gained 
considerable importance in the past few years, as is evidenced by a large 
number of papers as well as by a variety of facsimile communication 
systems. More and more sophisticated coding algorithms are being used 
which depend upon the two-dimensional spatial correlation present in 
picture data. This trend is understandable when one realizes that the 
cost of digital circuits and memories is decreasing faster than the cost 
of transmission. 

This paper presents some extensions of our ordering schemes?” for 
efficient coding of facsimile pictures. In the basic ordering scheme we 
make a prediction of the present element using the surrounding previ- 
ously transmitted picture elements and classify it as ‘good”’ or “bad,” 
depending upon the probability of the prediction being in error, condi- 
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tioned on the specific values of the surrounding elements. We then 
change the relative order of the prediction errors corresponding to pic- 
ture elements along a scan line using the “goodness” of the prediction 
in such a way as to increase the average run-length of the black and/or 
white elements and then transmit the run-lengths. 

This paper has several objectives. First, we give the entropy results 
using our earlier ordering schemes on the CCITT (International Tele- 
graph and Telephone Consultative Committee) images. This will allow 
a comparison with the many coding algorithms proposed by other 
workers since the CCITT images are widely available. This was not pos- 
sible from the results presented in our earlier paper where we had used 
locally generated picture material. The second objective is to present 
certain extensions of the ordering schemes and give results of computer 
simulations. The following extensions are presented: (1) Since good and 
bad regions of the ordered prediction errors have different statistics, two 
sets of run-length codes can be used. It is not necessary to specify the 
location of the boundary between the good and bad regions to the re- 
ceiver. (ii) Runs across the good-bad region boundary can be bridged 
wherever advantageous, even if the color of the element changes across 
the boundary. (vii) A specified run in each line of data can be omitted 
from transmission since the number of elements in a line is fixed. The 
length of the omitted run can be derived at the receiver if a line sync code 
is transmitted at the end of each line. 

Computer simulations indicate that entropies ranging between 0.021 
and 0.125 bits/pel for the eight CCITT pictures are possible using these 
extensions. This represents a 11- to 21-percent decrease over the ordering 
techniques of our earlier paper.! 


ll. CODING ALGORITHMS 


In this section, we describe our coding algorithms in detail and present 
results of the computer simulations. The pictures used for simulations 
are the eight CCITT pictures which have a resolution of approximately 
200 dots/inch. Each picture consists of 2128 lines with 1728 picture el- 
ements (pels) in each line. Copies of these pictures are shown in Figs. la 
through 1h. As a measure of performance, we used the sample first-order 
entropy of run-length statistics. We computed the average black and 
white run-lengths and the entropy of black and white runs using, for 
example, the formula 

Ey =~ D5 logs (1) 
where E,, is the entropy of the white run-lengths, n; is the number of 
white runs of length i, and N is the total number of white runs. Using 
these and eq. (2), we computed the entropy, £, in bits/pel by: 
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(c) (d) 


Fig. 1—The eight (a to h) CCITT pictures used for computer simulation. Each picture 
consists of 2128 lines with 1728 pictures elements in each line and has an approximate 
resolution of 200 dots/inch. (Figs. le through 1h on next page) 


_ Ey: Nw t+ E+ No (2) 
rN, w + ryN b : 

where E, is the entropy of the black run statistics, r,,, r, are the average 
white and black run-lengths, respectively, N,,, Nz, are the number of 
white and black runs, respectively, and E is the entropy in bits/pel. The 
above numbers are computed for the entire picture (1728 X 2128 pels) 
using, on the sides and top of the picture, a border of white elements 
surrounding the actual picture. 
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(g) 


Fig. 1 (Continued from previous page). 


The first step in the ordering algorithm consists of making a prediction 
of the present picture element using the already transmitted surrounding 


picture elements. We define a state S; using the four surrounding picture 


1 Prediction algorithm 


2. 


4 as shown in Fig. 2. There are 16 states. The pre- 
in a standard way? as the one which minimizes the 


probability of making an error, given that a particular state has occurred. 


Thus the predictor C(S;), 


, 


mas 
7 & 
a> & 
tm D 
<> 
aw 
nO 
=a 

Sant 
eS 
3 6 
— 
os 


is given by 


Uy 


for a given state S; 
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x x x PREVIOUS LINE 
X, Xo 
i PRESENT LINE 
PICTURE - ~ 


ELEMENT TO BE PREDICTED 


Fig. 2—Configuration for state definition. 


C(S;) = “black,” if P(X = “black” |S = S;) > 0.5 
= “white,” otherwise, 


where P(-|-) is the conditional probability measured for the picture. For 
convenience, we represent the color of the picture elements by “1” and 
“0,” “1” for black and “0” for white. The predictor varies from picture 
to picture; however, the variation is not great, as shown in our earlier 
paper.! The predictor for a typical picture [CCITT picture 2 (Fig. 1b)] 
is shown in Table I. 


2.2 Ordering algorithms with one set of run-length codes 


In this section, we give the simulation results using our earlier ordering 
algorithms. First, in Table II, for the purposes of comparison, we give the 
entropies of the run-length statistics from the raw picture data as well 
as from the prediction error data. As expected, the entropies of the 
run-lengths of the prediction errors show about 0.7 to 24 percent decrease 
over the entropies of the run-lengths of raw data. The decrease is smaller 
for the busier pictures such as the CCITT pictures 4 and 7. 

Next, we simulated the ordering algorithm of Ref. 1. As explained 
there, this algorithm can be illustrated by considering a memory con- 
taining 1728 cells (equal to the number of elements per line). Let the cells 


Table |—State-dependent prediction for CCITT picture 2 (Fig. 1b) 


State S; P(X0/S;) Predicted 

Xt Xe X3 X4 Xo =0 Xo =1 Value Xo 
So 0 0 0 0 1.000 0.000 0 
Si 1 0 0 0 0.300 0.700 1 
Se 0 1 0 0 0.777 0.223 0 
S3 1 1 0 0 0.006 0.994 1 
S4 0 0 1 0 0.822 0.178 0 
Ss 1 0 1 0 0.055 0.945 1 
Se 0 1 1 0 0.323 0.677 1 
S7 1 1 1 0 0.001 0.999 1 
Sg 0 0 0 1 1.000 0.000 0 
So 1 0 0 1 0.690 0.310 0 
Sio 0 1 0 1 0.971 0.029 0 
Su 1 1 0 1 0.154 0.846 1 
S12 0 0 1 1 0.996 0.004 0 
Si3 1 0 1 1 0.200 0.800 1 
Si4 0 1 1 1 0.708 0.292 0 
Sis 1 1 1 1 0.012 0.988 1 
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Table Il—Entropy comparisons for different coding algorithms. The entropy numbers do not include certain housekeeping bits (e.g., line 
sync, color of the beginning run in a line) 


Entropy (bits/pel) 
Coding Algorithm CCITT Image Number 
No. 1 2 3 4 5 6 7 8 
1 One-dimensional run-length coding 0.0505 0.0447 0.0914 0.1652 0.0988 0.0679 0.1791 0.0870 
2 Run-length coding of prediction errors 0.0466 0.0373 0.0693 0.1640 0.0795 0.0482 0.1678 0.0678 
3 Run-length coding of ordered prediction errors; 0.0390 0.0267 0.0571 0.1396 0.0652 0.0366 0.1400 0.0463 


one set of codes, goodness threshold = 0.1. 
Run-length coding of ordered prediction errors; 0.0398 0.0305 0.0592 0.1424 0.0673 0.0392 0.1442 0.0569 

all “white” state = good state 

4 Run-length coding of ordered prediction errors; 0.0356 0.0247 0.0547 0.1287 0.0613 0.0351 0.1284 0.0430 
two sets of codes, goodness threshold = 0.1, 
good—bad boundary run broken 

5 Run-length coding of ordered prediction errors; 0.0351 0.0233 0.0527 0.1282 0.0596 0.0335 0.1274 0.0419 
two sets of codes, goodness threshold = 0.1, 
good—bad boundary bridged 

6 Run-length coding of ordered prediction errors; 0.0338 0.0201 0.0501 0.1320 0.0579 0.0298 0.1326 0.0390 
one set of codes, goodness threshold = 0.1, 
first run dropped 

7 Run-length coding of ordered prediction errors; 0.0324 0.0210 0.0506 0.1239 0.0569 0.0312 0.1250 0.0398 
two sets of codes, goodness threshold = 0.1, 
good—bad boundary bridged; 
last decodable run dropped 


of this memory be numbered from 1 to 1728. We classify the states used 
for predictors into two categories, good or bad. Good states are those for 
which the probability of the prediction being in error, conditioned on 
that state, is less than a given threshold (defined as the goodness 
threshold). All the other states are bad. In the process of ordering, if the 
first element of the present line has a state which is classified as good, 
we put the prediction error corresponding to it in memory cell 1; if, on 
the other hand, the state is classified as bad, we put the prediction error 
in memory cell 1728. We continue in this manner: the prediction error 
for the ith element of the present line is put in the unfilled memory cell 
of the smallest or the largest index, depending on whether the state 
corresponding to the ith element is good or bad. When the memory is 
filled, its cells are read in numerical order and the contents are run-length 
encoded. It is easy to see that the present line can be uniquely recons- 
tructed from the knowledge of the run-lengths of the ordered line, since 
the ordering information is known to the receiver. The efficiency of such 
ordering depends upon the threshold used for classifying the states into 
good or bad. Table II shows two examples, one in which the goodness 
threshold was 0.1 and the other in which only one state (corresponding 
to all four surrounding elements being zero) is classified as good. A 
goodness threshold of 0.1 appears to be acceptable among the many 
thresholds that we used in our simulations. Comparing entropies cor- 
responding to the ordered and unordered prediction errors, we see that 
ordering reduces the entropy by about 15 to 32 percent, depending on 
the picture used. Also, ordering of the prediction errors brings entropies 
down by 15 to 47 percent of the run-length coding of raw data. It should 
be noted that in each of the above cases the predictor was optimized for 
the particular picture. 


2.3 Ordering algorithms with two sets of codes 

Statistics of the run-lengths in the good and bad regions of the ordered 
prediction errors are quite different. As an example, for CCITT picture 
2 (Fig. 1b), 98.5 percent of the pels fall in the good region of which 99.9 
percent are correctly predictable, whereas the bad region contains only 
1.5 percent of the total elements of which 73 percent are correctly pre- 
dictable. Thus, the average run-lengths in the good region are much 
larger than in the bad region. Such a variation in the statistics can be 
exploited by using two different sets of run-length codes for the good and 
bad regions, respectively. The algorithm* would then operate as follows: 
First, we put the ordered prediction errors in the memory as before; then, 
the contents of the memory are run-length coded with one set of codes 
in the good region and a different set of codes in the bad region. 


* This algorithm is related to the one proposed by Preuf (Ref. 5). It is discussed here 
mainly for completeness and was motivated by the communication we received from him 
(Ref. 6). 
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Switching from one set of codes to the other is done at the boundary of 
the good-bad region even though the ordered line may not have a new 
run at the boundary. This process will break the run at the boundary 
between the good and bad region of the ordered line, whereas the or- 
dering technique discussed in Section 2.2 continues the run (whenever 
possible) across the boundary of the good-bad region. This procedure 
is continued until all the runs from the memory are exhausted. 

At the receiver, the coded run-lengths for a complete line are held in 
a memory. Good or bad runs are decoded from the memory as 
needed. 

The results of computer simulations for the ordering scheme with two 
sets of codes are shown in Table II. These results use a goodness threshold 
of 0.1. Comparing the entropies from algorithms with one and two sets 
of codes, it is seen that with two sets of codes about 4 to 8 percent im- 
provement is possible. This is the opposite conclusion* from that given 
in our earlier paper, which used a different source material.! For the 
pictures used in Ref. 1, we had found that ordering schemes with two sets 
of codes resulted in 10 to 18 percent higher entropies than the entropies 
obtainable with one set of codes. This may have been a result of the small 
size of the pictures used for the simulation (an array of 256 256 picture 
elements). 


2.4 Ordering algorithms with two sets of codes and bridging of good-bad 
boundary 


Use of two sets of run-length codes described in the previous subsec- 
tion resulted in the breaking of a run at the boundary of the good-bad 
region since part of the run may be in the good region and the other part 
may be in the bad region. To avoid breaking the run, which extends 
across the boundary, we code the boundary run using the run-length code 
of the good region or the bad region as follows: If the boundary run is first 
required as a bad run in the process of decoding the run-lengths at the 
receiver, it is coded as a bad-run; otherwise, it is coded as a good run. The 
method in which the receiver decodes the bridged run is similar to the 
one given in the next subsection. Results of such a scheme are shown in 
Table II. Bridging of the runacross the boundary results in an improve- 
ment of about 0.39 to 6 percent over nonbridging. As would be expected, 
the percent improvement is smaller for busier pictures. 


2.5 Ordering algorithms with dropped runs 


In most facsimile communication systems a code for the line sync is 
sent at the end of each line of coded data. Since the number of elements 
in a line is fixed, this is redundant. A run can be dropped from each line 


* We thank D. Preuf for showing us data from his simulations which first demonstrated 
this fact. 
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as long as the receiver knows the position of the dropped run. In the or- 
dered line a large benefit can be derived by dropping the first good run, 
since it is generally the longest. This also avoids transmission of run- 
length codes for lines with no prediction errors. Table II shows results 
of the simulation of a scheme in which the first good run from the ordered 
prediction errors is dropped from transmission, and the rest of the runs 
are transmitted by using one set of run-length codes. Dropping the run 
reduces the entropy to between 0.020 and 0.133 bits/pel which is a 5 to 
25 percent reduction compared to the case where all the runs are 
sent. 

It is also possible to drop a run from transmission when two sets of 
codes are used for the run-lengths in the good or bad regions. In this case, 
the first run cannot be dropped since the receiver switches between the 
two sets of codes depending on the past decoded data. However, the last 
run that the receiver needs to decode may be dropped. We have simu- 
lated a scheme in which the good-bad region boundary is bridged and 
the last decodable run is dropped. To explain the scheme, consider a line 
made up of run-lengths of ordered prediction errors as shown in Fig. 3. 
We use two sets of codes and start transmitting codewords corresponding 
to run-lengths G1, Go, ---, Bo, By of the ordered line, appropriately 
switching the code in the good and bad regions. The receiver decodes 
these run-lengths as needed. To bridge the boundary run and drop the 
last decodable run, we use the following rules: 

(i) If there are no runs in the good region, drop the last run in the 
bad region, i.e., Bm. 

(ii) If there are no runs in the bad region, drop the last run in the good 
region, i.e., Gp. 

(iii) If the last two runs required by the receiver in the decoding 

process are G,, and B,, (in either order), drop the runs G,, and 
Bm. This is done independently of the color of prediction errors 
in G, and By. 

(iv) If the last two runs required by the receiver are from the bad 

region and at least one good region run has occurred, then if 

(a) color of B,, isa “1,” bridge G, and B,,, code it using the good 
region code, and drop B,,-1. 

(6) color of By is a “0,” drop Bm. 

(uv) If the last two runs required by the receiver are from the good 

region and at least one bad region run has occurred, then if 

(a) color of G, is a “1,” bridge G, and Bm, code it using a bad 
region code, and drop G,,~. 

(b) color of G, is a “0,” drop Gp. 


Rules (zv) and (v) allow us to drop a run of 0s rather than arun of 1s, 
since runs of Os usually have longer lengths than runs of Is. Also, it is 
possible to bridge the runs at the boundary independent of the color 
change across the boundary of the good and bad region. Thus, the above 
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GOOD REGION BAD REGION 


ORDERED LINE 
Fig. 3—Ordered run-lengths. 


strategy allows dropping a run from transmission, bridging runs across 
the boundary (whenever it is advantageous, even if colors change), and 
the use of two separate sets of codes for the good and bad regions. 

At the receiver, the coded run-lengths are held in memory and decoded 
as needed. A running total of the number of elements from decoded 
run-lengths is kept. If all the run-lengths have been decoded from the 
receiver memory and an additional run is required, this running total 
is subtracted from the total number of elements in a line, and the result 
is taken as the length of the next run. If the result is zero, then the next 
run is taken to be of opposite color, as usual, and decoding proceeds until 
the end of the line. The simulations using the above scheme decreased 
the entropy to between 0.021 and 0.125 bits/pel as shown in Table IT. For 
busy images this scheme does better than the scheme which uses only 
one set of codes and drops the first run. However, for quieter pictures 
the performance is reversed. 


lll. DISCUSSION AND SUMMARY 


We have described in this paper schemes for efficient coding of two- 
level (black and white) facsimile pictures. These were extensions of our 
earlier schemes which ordered the prediction errors before run-length 
coding. The most sophisticated extension presented here results in an 
entropy of between 0.021 and 0.125 bits/pel. Our computer simulations 
indicate that use of two sets of codes for good and bad regions of the 
ordered pictures results in about 4 to 8 percent decrease in entropy 
compared to using only one set of codes; whereas using two sets of codes, 
bridging the good-bad boundary run, and dropping the last decodable 
run decreases the entropy by 11 to 21 percent. 

It should be mentioned that this is not a definitive coding system 
study. We have not considered many important factors crucial to the 
success of any coding system such as the run-length codes and their 
picture dependence and the effect of transmission errors. 
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Free Electron Laser 


by A. HASEGAWA 
(Manuscript received March 6, 1978) 


An introductory guide to the basic mechanisms of the free electron 
laser is presented. The laser gain originates from the stimulated Raman 
or Compton backscattering of a pump electromagnetic field by a rela- 
tivistic electron beam. The condition of optimization of the gain, the 
maximum operation frequency, and the optimum output power are 
obtained in terms of the beam parameters and the magnitude of the 
pump magnetic field. 


1. INTRODUCTION 


Recent observations of amplification of submillimeter! and infrared? 
electromagnetic waves using a relativistic electron beam (REB) have 
created interest in applying the mechanism to produce a high-power, 
tunable laser in the infrared to visible range as well as in speculating the 
possibility of constructing an X-ray laser. 

This paper introduces the basic mechanism of the amplification 
processes and discusses the limitations in the power and frequency re- 
ferring to the presently available REBs. A nonspecialist should be able 
to follow the contents without referring to special references. 

Section II introduces Lorentz transformation of various variables 
between the beam and the laboratory frames, which are used in suc- 
ceeding sections. 

One of the important discussions presented here is the distinction 
between the stimulated Compton and stimulated Raman scattering. 
When the scattering occurs by an excitation of a single particle state, 
uncorrelated free-streaming motion of electrons, it is called the stimu- 
lated Compton scattering; if it occurs by an excitation of plasmon, the 
collective plasma oscillation of the electrons, it is called the stimulated 
Raman scattering. In most cases, the stimulated Compton scattering has 
a gain which is too small to be useful for practical purposes. Hence, the 
limitation in the output frequency is decided by whether or not the rel- 
ativistic electron beam can be operated in the stimulated Raman regime. 
The beam current density and the energy spread is the decisive factor 
for this, as shown in Section III. 


3069 


The 


gain calculations based on classic mechanics are presented for 


both processes in Sections IV and V. The classic calculation is justified 
when the scattered photon density is large so that the photons can be 
regarded as consisting of a continuous fluid. This occurs when the 
number of photons in a box of its wavelength (A) cubed is much larger 
than unity; that is, when \8P/(Awc) >> 1, where P is the electromagnetic 
power and c is the speed of light. 

Some design examples using presently available REBs are shown in 
Section IV. MKS units are used throughout this paper. Definitions of 
the notations and subscripts used are listed below. 


z: coordinate taken in the direction of the beam velocity. 
x,y: coordinates perpendicular to the beam velocity 
m: electron rest mass 
Dp: momentum 
P: 
vo 
Vg 
E 
B 
€ 


power 


: beam velocity 
: group velocity 
: electric field intensity 


: magnetic flux density 
: speed of light, 3 X 108 m/s 


y: (1 — vg/e2)-¥/ [eq. (5)] 


Ho: 
Yo: 
Wp: 


Ro: 


Wo: 

€0: 

UT: 
Ay/y: 
r: 


wW 


~, 


k;: 


Ret 


W): 


—_ 


kj 
By 
Rp 


Wer 
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beam energy 

Ho/mc? [eq. (21)] 

plasma angular frequency, frame invariant 

2m/Xo (Ao is the periodicity of the helical winding of the pump 
magnetic field, Fig. 1) 

koc 

space dielectric constant, 8.854 X 107!2 F/m 

thermal speed in the beam frame [eq (17) and (35)] 
relative energy spread of the beam in the laboratory frame 
temporal gain 

: incident electromagnetic wave angular frequency, which 
corresponds to the pump frequency in the beam frame 
incident wavenumber, beam frame 

scattered electromagnetic wave angular frequency, beam 
frame 

scattered wavenumber, beam frame 

longitudinal electrostatic wave angular frequency, beam 
frame 

: longitudinal wavenumber, beam frame 

: transverse pump magnetic field, laboratory frame 

: Debye wavenumber w,/ur in the beam frame 

: angular frequency of transition from stimulated Raman to 
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stimulated Compton scattering in the laboratory frame [eq. 
(37)] 
Jo: beam current density 
v;: amplitude of oscillating velocity of electrons due to the 
incident (pump) wave [eq. (24)] 
Subscript L: quantities in the laboratory frame 
Subscript B: quantities in the beam frame 
Subscript 1: longitudinal wave, beam frame 
Subscript s: scattered wave, beam frame 
Subscript 1: incident wave, beam frame 
Subscript 1: component perpendicular to z. 


ll. LORENTZ TRANSFORMATIONS 


To understand the dynamics of the REB, we must first refresh our 
memory of the Lorentz transformations which are relevant to our 
problem. If we take z axis in the direction of the beam velocity as in 
Fig.l and usesubscriptsLandB to represent thelaboratory andthe beam 
frame, the Lorentz transformations of the coordinate z and time t for 
a REB with the velocity vo are given by (for example, see Ref. 3): 


2B = y(2L — Votz) (1) 
or 
ZL = y(zp + vots), (2) 
and 
Vo 
tRp=Y (t at): (3) 
c . 


ELECTROSTATIC WAVE, Wy, ky ‘; 
Zz i 
< 2m/k, 





Fig. 1—Schematic diagram of a free electron laser which utilizes the helical magnetic 
pump field. The helical current produces a periodic magnetic field which induces longi- 
tudinal electrostatic oscillations in the beam. A nonlinear interaction between the induced 
longitudinal oscillation and the periodic pump field produces an electromagnetic wave 
which propagates in the direction of the beam. This process can be viewed, in the beam 
frame, as a stimulated backscattering of the pump field by the electrons in the beam. Since 
the scattered wave propagates at the same speed as the beam itself, the beam length, Lp, 
can be asize of several wavelengths in the beam frame. However, the length of the helical 
field, Ly, should be such that enough e-folding gain‘can be obtained. The minimum 
e- folding distance is obtained in eq.(91). L;, should therefore be much larger than L,,in this 
equation. Typically, L,, is on the order of 1 m. 
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or 


v 
tL=yY (ts +28), (4) 
where 
2. —1/2 
y= (1 - =) (5) 
co 


and c is the speed of light. Similarly, the electric field intensity E and 
the magnetic flux density B are transformed to 


Ep, = Er, (6) 
Ep. = y(Ex. + vo X Bz), (7) 
and 
Bpz = Biz, (8) 
1 
Bai =yY (Bis -jvoX Ez), (9) 


where subscript 1 shows the component perpendicular to the beam 
velocity. Equation (7) indicates that a transverse magnetic field which 
is static but spacially periodic in the z direction with the periodicity 27/ko 
creates an oscillating electric field in the beam frame with the frequency 
given by ykovo. Transformations of velocities are obtained by taking the 
derivatives of (2) and (4), 


F UBz + Uo 
Lz = . 
1 + vovp-/c? 


The beam has transverse velocity modulation due to the vo X By , Lo- 
rentz force. The Lorentz transformation becomes 


(10) 


(1 + vovg,/c?) 


1 
= URW (11) 
Y 


ULA 


The Lorentz transformations for frequency and the wave number are 
obtained by considering the phase factor kpz, + wytz of a wave in the 
laboratory frame, exp i(kz + wt); we take a wave propagating against the 
beam direction to consider the back scattering. 


kpzy + witty = ¥ (Fu + a) zpt+ yw, + kpvo)ta; — (12) 
c 


hence 


kp=y (A + Fen), (13) 
c 
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wp = y(wr + kyvo). (14) 


One important aspect of this result is that the frequency seen by the 
beam is y times the laboratory frequency wz plus y times the Doppler 
shifted laboratory frequency Ryvo. An electromagnetic pump wave 
propagating against the beam direction (whose dispersion relation is 
given by w = kc) has a frequency given by y(w + Rug) ~ 2yw when ob- 
served in the beam frame. Similarly, the frequency w, of the back-scat- 
tered light which faces little frequency shift from the incident light in 
the beam frame becomes 2yw, when observed in the laboratory frame. 
Hence, the frequency of the back-scattered light in the laboratory frame 
is given approximately by 4y2 times the incident (pump) frequency in 
the laboratory frame. 

The pump frequency can be de when a periodic magnetic field is used. 
In this case, the frequency of the scattered wave is given by 2y7kou9, 
where ko is the wave number of the periodicity \o, Ro = 27/Xo, of the 
magnetic field (see Fig. 1). 

In addition to these quantities, we need the transformation of the 
plasma frequency, wy, the beam thermal speed v7, the beam oscillating 
velocity in the transverse direction due to the pump field v ,, and the 
growth rate I. 

Since the Lorentz contraction increases the density by y and the mass 
also by a factor y, the plasma frequency, w, (= e2n/egm)!/? (where ¢, is 
the electron change, n the beam density, and ¢« the space dielectric 
constant), is frame invariant. 

The thermal speed in the beam framé v7 can be expressed in terms 
of the energy spread of the beam in the laboratory frame as follows. From 
the definition of y in (5), 


it 

ve = c? ( — ae): (5’) 
Y 

Hence the velocity spread 6uvo in the laboratory frame is expressed in 


terms of the spread in y, 


A 
609 = Cc — : (15) 
Y 
Now if we use the Lorentz transformation of vz, (10), 
Avpz 
6U9 = 60,2 = — 
, “(1 + vovp2/¢?) 
1 aE 
om 72 Avg = 2 (16) 


because ug, = 0. Hence from (16) the thermal speed in the beam frame 
is obtained: 


AY 


vrT=c (17) 
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Next, we obtain the oscillating transverse velocity in the beam frame. 
We consider the example of periodic magnetic pump. In this case, the 
beam kinetic energy Ho does not change due to the presence of the pump. 
If we introduce yo to represent the total kinetic energy of the beam, 


Ho = e(pj, + mc?) 
= me*yo, (18) 


where py, is the momentum in the laboratory frame (Ho is not frame 
invariant, but we delete subscript L for this quantity). The velocity 
components in the transverse and z directions are obtained in terms of 
PL as 





oH. 1 
vy = = —— pr (19) 
OPLi MYo 
1 
ULz = Vo = my Pry (20) 
0 


If we substitute (19) and (20) into (17), we can obtain the relation be- 
tween Yo and ¥ as defined in (5), 


v2 
= (1 + are (21) 


This expression shows that y can be significantly different from yo even 
if v7 , /c? « 1. With these preparations, we can now obtain vg , in terms 
of the pump magnetic field. The equation of motion of an electron in the 
presence of a transverse helical pump magnetic field B, (B, cos koz, 
B, sin koz, 0) is given by 


apis _ ae: dv1 
dt © dt 


since Yo is constant. If we assume Ug > Uz |, 2 = Vot, (22) can be imme- 
diately integrated to give 





+= —e(vo X B,), (22) 





B B 

UL, = ( <= + Cog (Rovot), A+ sin (kovot), 0). (23) 
myoko mvyoko 

As will be seen, we need only the magnitude of the oscillating velocity 

in the beam frame, |vg, |, which may be obtained from (23) and (11), 


yeB , 
Jub. | = (= 0;). 
yomko 


This gives the relation between the oscillation amplitude of the electrons 
in the beam frame and the pump magnetic field in the laboratory 
frame. 

We now consider the transformation of the growth rate I. If a wave 
with slowly varying amplitude Ag(zg,tg) grows in time and space at a 


(24) 
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temporal growth rate I'g in the beam frame, Ag satisfies the following 
equation 

oAp OAR 

— + v = [pAp, 25 

Sp. Te = BAB (25) 

where upg is the group velocity in the beam frame. If we use (2) and (4), 
0/Otz and 0/dzz can be expressed in terms of derivatives in the labo- 
ratory frame. 


a One = i) ee (v9 + One) 5 (26) 
If we substitute (26) into (25), we see 
OABz # Uo + Upg OAR _ I'zp (27) 
Ot, 1+ vpgvo/c? Oz, ~ (1 + ele 
The amplitude in the laboratory frame is linearly proportional to Ag. 


Hence (27) gives the Lorentz transformation of the group velocity as well 
as the growth rate, i.e., 


we UO PUB 5 Nips 98 

te 4 Evpgvgle2 2 (v0 + UBg), (28) 
T T 

Ty, = ——__2—_ ~ -8. (29) 


y(1 + upgvo/c?) 27 


lil. STIMULATED COMPTON OR STIMULATED RAMAN SCATTERING? 


We consider here the basic processes of the stimulated scattering in 
the beam frame. If we designate the frequency and wave number of the 
incident (pump) wave by w; and k; and those of the scattered (amplified) 
wave by w, and k;, the frequency and wave number of the longitudinal 
oscillation excited in the beam (which is a stationary electron plasma 
in the beam frame) are given by 


W] = Wj — Ws, (30) 

ky > k; k,. (31) 

We note here that the incident and scattered waves are electromagnetic 

waves, hence w;/k; ~ w;/k, ~ c, while the longitudinal wave in the elec- 

tron plasma has a phase velocity, w)/k), much smaller than the speed of 
light. 

To consider the backscattering, which is needed to utilize the fre- 

quency up conversion as discussed in Section II, as well as to maximize 


the gain, we must take k,-k; = —|k,||k;|. The incident wave propagates 
against the beam direction, hence k; = —|k;|2. Thus [{k,| = |k,| + 
[Ril 


Now the longitudinal mode in the electron beam has the plasma dis- 
persion relation given by 
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w? ta Ofo/dv 
ee ge a LR ee 
Be Joh oO 82) 


where fo(v) is the velocity distribution function of the beam electrons 
in the beam frame and is assumed to be nonrelativistic. If we solve (32) 
for w, we have 


w & Wp ifk; « Rp, (33) 
w ~ kyl — i0(1)] ifk; > kp, (34) 


vp = | Sf. v%fode |" (35) 


is the thermal speed of the electrons and kp = wp/ur is the Debye wave 
number, both in the beam frame. Equations (33) and (34) indicate that 
if the wave number is larger than the Debye wave number, the collective 
property of the plasms oscillation is lost. The large imaginary part in (34) 
is the consequence of the Landau damping. 

Now the dispersion relation of the electromagnetic wave is given by 


w2 = c2k2+ we. (36) 


If we use the dispersion relations for w; and w, [which satisfies (36)]| and 
« [which satisfies (32)], the resonant conditions, Eqs. (31) and (82), can 
be plotted in (w,k) diagram. For the case of backscattering, the plots are 
shown in Fig. 2 (for the case of k} « kp) and Fig. 3 (for the case of k, > 
kp). In these figures, the arrows show the direction in which the state 
with energy hw; and momentum hk; decays into two other states with 
energy hws, and hw, and momentum hk, and hk). The decay process 
shown in Fig. 2 describes the stimulated Raman scattering and that in 
Fig. 3 the stimulated Compton scattering. 

Both figures show backscattering because k; and k, have opposite 
signs. We see from these figures that if ws >> wp, |ki| ~ 2|ks|. Hence for 
a given quality of a beam if w,(= k,c) is increased, k; which may be ini- 
tially smaller than kp becomes larger than kp at some value of w,. Hence, 
there exists a critical frequency of the scattered wave (which corresponds 
to the lasing frequency in the beam frame) above (below) which scat- 
tering process becomes Compton (Raman). If we write this critical an- 
gular frequency in the laboratory frame as w,,, that is, the actual lasing 
frequency, w,., can be expressed in terms of the beam quality. Using 


where 


Wer = 2YWs 
ws = CRs 
hk, = 2k, = kp ’ 
we have, with eq. (17), 
Wer = yvkpe 
= Ywp(y/Ay). (37) 
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Fig. 2—Dispersion diagram of the electromagnetic wave and plasma wave in the beam 
frame. This diagram shows the stimulated Raman scattering process. The arrow indicates 
the direction of decay of the incident wave with frequency and wave number given by w;, 
k; into a longitudinal oscillation with frequency wp and wavenumber hk) and a backscattered 
electromagnetic wave with frequency w, and wavenumber ks. 


Thus the critical frequency depends on the relative spread of the beam 
energy observed in the laboratory frame, Ay/y, as well as the beam 
density and y. Since the plasma frequency is frame-invariant, it may be 
expressed in terms of the current density Jo of the beam. Equation (37) 
then becomes 

Wer = 8.14 X 108 y(y/Ay)Jo!. (37’) 
Since MKS units are used, Jo is in the unit of A/m2. This expression is 
an important criterion in designing the laser, because at w > w,, it should 
operate in the stimulated Compton regime and the growth rate becomes 
pessimistically small. Fora practical purpose,+w = we, is the high- 
frequency limitation of a free electron laser. 


IV. THE STIMULATED RAMAN SCATTERING 


In this section, we derive the growth rate in the stimulated Raman 
regime. A number of authors have derived the growth rate using different 
methods. The classic mechanical calculation is much simpler than the 
quantum mechanical one and is well justified for a stimulated process 
because a large number of photons are produced at a very early stage of 
the process. Tytovich’s book‘ and a review paper by Kaw et al. are some 
of the appropriate references on this subject. 
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Fig. 3—The dispersion diagram that shows the stimulated Compton scattering process. 
When the wave number of the induced longitudinal oscillation k; is larger than the Debye 
wave number kp, the induced longitudinal oscillation in the beam electrons becomes un- 
correlated. In this case, the scattering occurs by the sum of Compton scattering by indi- 
vidual electrons. Since the induced wave number k; is proportional to the lasing frequency, 
when the lasing frequency is increased, the scattering process changes from the stimulated 
Raman to the stimulated Compton. 


Attempts have been made to obtain the gain in the laboratory frame 
using a rather complicated nonlinear relativistic dynamics.©’ As has been 
shown, the gain and all the other parameters can be Lorentz-transformed 
into the laboratory frame, it is much simpler to do the nonrelativistic 
calculation in the beam frame. Thus we do the analysis in the beam 
frame. Referring to Fig. 2, we consider a large amplitude incident wave 
propagating in the negative z-direction with transverse electric field 
given by 

ReE; exp 1(kjz + at), (38) 


where k; and w; are positive. E; is related to the pump field in the labo- 
ratory frame through the Lorentz transformation shown in eq. (7). In 
particular, if the static periodic magnetic field is used, E; is given by 

[|£;| = yuoB, ~ ycB_, (39) 
where By is the amplitude of the rippled or helical magnetic field in the 
direction perpendicular to the beam. 

To simplify the analysis, we assume the variation of E; and all the 
other field quantities in the transverse direction is negligible. This 
assumption may be justified if the beam diameter is much larger than 
all the wavelengths involved. 

To obtain the growth rate, we consider a test electromagnetic wave 
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(the scattered wave) which propagates in the direction of the beam and 
which is excited by a nonlinear current density produced by the product 
of the incident field and the induced longitudinal density perturbation 
in the beam. 

From the Maxwell equation, the electric field of the scattered wave 
E, satisfies the wave equation 


2 
vp, — LOE: _ | Os 
c 








(40) 


where the current density consists of the linear (self-consistent) portion, 
J”, and the nonlinear portion J*”, which is produced by the incident 
field, 


Jg= J+ JM, (41) 
where 
J’ = —enovs (42) 
and 
JN" = -enyv;. (43) 
v, is the electron velocity modulation due to the scattered field 
“== §,, (44) 


while v; is the modulation due to the pump field. In the case of a helical 
field pump, v; is given by eq. (24), 

B 
yomko 


and n, is the density modulation due to the induced longitudinal oscil- 
lation in the beam, which satisfies the continuity equation, 


on) 
——+V- = 0, 46 
ai (novi) (46) 
with 
dv e 
—= —— —). 47 
r me (47) 


FE, is the electric field of the longitudinal oscillation. 

If we Fourier-transform (43), J‘” contains two frequency components, 
one the Stokes mode, w; — w and the other the anti-Stokes mode, w; + 
w, where w is the frequency of the induced longitudinal oscillation. To 
obtain the growth rate due to the stimulated Raman scattering, we need 
to retain only the Stokes mode. (We discuss the effect of anti-Stokes 
mode later.) If we Fourier-transform eqs. (40) to (44), retaining only the 
Stokes mode, we have 
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2 — al a «)| E, = —i(w; — )poenivi, (48) 


where * shows the complex conjugate. 
If we express 7, in terms of Ej, using Eqs. (46) and (47), 


k,-E 
nN = No “SU ; (49) 
m iw 
eq. (48) becomes 
D,; (ks, w; = w)E, = Ws (ky 7 E})v;, (50) 
where 
D,(k,w) = k2e2 + we — w2, (51) 


and w) ~ wp is used in evaluating the right-hand side of (50). D, = 0 gives 
the linear dispersion relation of the scattered electromagnetic wave. 
Equation (50) shows that the dispersion relation is modified by the in- 
cident electromagnetic wave and the induced longitudinal wave. 

To close the equations, we now must express EF) in terms of E, and vj. 
The set of equations that describe the longitudinal mode are Poisson’s 
equation, 

Vv: EF) et ae: (52) 
€0 
and the continuity equation (46), both of which are linear, and the 
equation of motion, 


GNI 8 ee Boga BD. (47’) 
dt m 


The continuity equation is linear because the electromagnetic wave is 
incompressible, n, = n; = 0. This means that the current density for the 
longitudinal mode is given by —enov). Hence, the only nonlinearity comes 
from the Lorentz force, v X B, in eq. (47’). Note that we dropped the 
corresponding nonlinear term in the calculation of J‘” because it is 
smaller than the term retained by the factor of v;/c. Also note that we 
used the linear relation, eq. (47), to express n) to evaluate the coupling 
term nv; of (50) because it was a higher order correction there. If we use 
the Maxwell equation, 

wB=kxXE, (53) 
the nonlinear terms in (47’) become 


(v; X B,; + v, X B;) 
k, X E; « _, k; X E; 
= (v area a Rs) 
Ws Wj 


2G. oR ee) 


Ss 


=-— (vi- E;)ki, (54) 


Ss 
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where we used k,-v; = 0. Hence the total longitudinal velocity modula- 
tion is given by 


* 





1 ae i 
‘ec (hie * ka). (55) 
lw Mm Ws 
If we use this expression in (46) and (52), we have 
;-E; 
Di(ky,w)ky - Ey = —kP ==, (56) 
Ws 
where 
a2 
Dy(k,w) = 1-*2, (57) 
(€9) 


and D, = 0 gives the linear dispersion relation for the longitudinal mode. 
Noting that E, is parallel to v; in eq. (50), eqs. (50) and (56) present the 
set of coupled equations between the scattered wave and the induced 
longitudinal wave, 


D; E, = ws (ky . E))vi, (50) 
2 Bw? 
Diky - Ey = —kP 2 , (56) 
Ws W 


through the velocity modulation by the incident wave v;. The dispersion 
relation of the coupled system is given by eliminating k-E, and E, from 
these equations, 


2 
D,(Rs,t0; — @)Di (ky) + “8 kPv? = 0. (58) 
w 


If k\v; is much smaller than w), eq. (58) may be solved for a small fre- 
quency deviation Aw from the frequency given by the linear dispersion 
relation by expanding D, and Dy as 


oD, 








D,(Rs,0; — wo) = Dy(ks,ws) + 5 Aw = 0+ 2, Aw, (59) 
WwW 
Rs, Ws 
while 

oD 
Dy(ki,w) = Dy(ky,o) + ea Aw = 2Aw/wp. (60) 

(6) 

Rj, 





Substituting (59) and (60) into (58), we have 


1/2 
Aw = +" |ky;| ) (61) 
2 Ws 
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The imaginary part in Aw gives the Raman growth rate in the beam frame 
rs hence 


2 
In the case of the periodic magnetic pump, v; is related to B, through 
eq. (45). The growth rate in this case is then given by 


20B 1/2 
pgs ee (63) 
yom Ws 
The gain in the laboratory frame is simply given by 'p/2y as shown in 

eq. (29). 
We note here that the ratio y?/yo can be expressed in term of - through 
(21), 


1 1/2 
rk == |kw;| - (=) (62) 


7? Yo 

yo 1+ yovzs/e? si 
This expression indicates that a level exists in the velocity modulation 
vy ,, or the pump strength B , , that produces a maximum growth rate. 
This is because an excessively large modulation deflects the beam too 
much in the transverse direction, which results in reducing the value of 
y. There are different ways by which the growth rate can be optimized 
depending on the choice of fixed quantities. In any case, the maximum 
growth is achieved by selecting 


vi v5/c? ~1, 
or in terms of the modulation magnetic field, 


pa ee ee | (65) 
m_ koc mM wo 


When the pump intensity is large such that the growth rate I'g be- 
comes larger than the plasma frequency, that is, if 


| Ryv; | > (wpws)'/, (66) 
the longitudinal mode loses its linear property. In this regime, the growth 


rate should be obtained from (58) without expanding D)(kj,w) around 
ki,p.® The growth rate is then modified to 


rg = [sbty'® es 
Wj 
This regime is often called the oscillating two-stream instability 
(OTSI).? 

If the pump amplitude is further increased, we should include the 
effect of the anti-Stokes mode which is simultaneously coupled in. The 
dispersion relation including the anti-Stokes mode becomes 


2.2 9 
1 Sl =0, (68) 
Dy(ky,w)w? Dg (Rs, 0 = w) Ds(k3 Oj + w) 
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where k} is the wave number of the scattered anti-Stokes mode. The 
growth rate in this regime is shown to be proportional to v?, and it cor- 
responds to the modulation instability (for example, see Ref. 10) of the 
pump wave. 


V. STIMULATED COMPTON SCATTERING 

Here we obtain the gain in the stimulated Compton regime. As was 
discussed in Section III, if the wave number of the longitudinal oscillation 
induced in the beam electrons is larger than the Debye wave number, 
Rp(= wp/v7r), the collective nature of the longitudinal mode is lost. The 
scattering then occurs by the individual electrons. 

Because distribution of velocities exists in the beam electrons, to ob- 
tain the total scattering gain we must average over the velocity distri- 
bution. If we look at Fig. 3, we see that the resonant condition of the 
stimulated Compton scattering in the beam frame is given by 

Wi — Ws = [Rlvr, (69) 
[kil + |Rs| = | Ail. (70) 

As we have seen in the case of the stimulated Raman scattering, we 
must obtain J” to calculate the effect of the pump on the scattered 
mode in (40). In the present case, the Fourier amplitude of J’” is again 
given by 

IN" = —enyj; (71) 
however, the calculation of nj is more complicated because of the aver- 
aging over the velocity distribution. 

To obtain nj, we use the Vlasov equation, which includes the nonlinear 
force term produced by the v X B force as seen previously. 

NL 

ft, fi, FR fo _ 

ot Oz m OQv;z 
where f/f and fo are the perturbed (which represents the induced density 
modulation) and unperturbed velocity distribution function of electrons 
in the beam frame, v, is the z component of velocity, and F/” is the 
nonlinear force acting upon electrons at the frequency w = a, 

FRE = —e(v; X B, + vs X B;). (73) 

In (72), the linear force produced by the self-field, e£/m, is ignored 
because the induced longitudinal field is nonresonant; that is, D)(k},a)) 
~ 0, due to the heavy Landau damping, and hence its amplitude is small. 
If we Fourier-transform eqs. (72) and (73) and take only the Stokes term, 
we have 


0, (72) 


; = Ofo/Ovz e ky 

‘i(k; — w) mus 
The induced charge density n is then obtained by integrating this ex- 
pression over Uz, 


v;-E,. (74) 
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*x 
. k? €oVi * E, 


en) =1——_—— x, (75) 
Ws 
where xj is the susceptibility of an electron gas, 
wp Ofo/dvz 


x= - duz. (76) 


kp Jvu-(w + i0)/ky 
The dispersion relation for the scattered wave is now obtained by sub- 
stituting (75) for the expression for the nonlinear current density, (71), 
and using it in the wave equation for the scattered electric field, (48). 


[(w; — w)? — w5 — ck? JE, 
= |v; | 2xf RfEs. (77) 
If we solve for w ~ w; — ws + Aw, we have 
Aw = —L RPI v;]2. (78) 
2W5 


The temporal growth rate is obtained from the imaginary part of xj. 
From Eq. (76), we see 
ofo 


2 
Im xj = at ic — w/|k\|) Sie 


If we take the Maxwellian velocity distribution for fo in the beam 
frame, 





duz. (79) 


1 2 2 
= ——— e707/207, 80 
fo V2Q0 UT oo 
2 
J = 3 ps. be ~ 0.76 Sep, 79/ 
a kjv} * e kfvi, 2) 
The Compton growth rate I'f is now obtained from (78) and (79’), 

w |vil? 

rp = 0.4—-—,-. (81) 
Ws UT 


If we compare the Compton growth rate '§ with the Raman growth rate, 
(62), we see a qualitative difference. The Compton growth rate is pro- 
portional to the pump amplitude squared, while the Raman growth rate 
is proportional to the pump amplitude itself. 

If the pump amplitude is increased such that v; > v7, it has been 
shown by Hasegawa et al.!! that the pump field effectively increases the 
velocity spread by v; X B; force and thus decreased the gain. The proof 
was made for an electromagnetic wave pump, but it is believed that even 
when the helical magnetic pump is used, the similar effect appears when 
the beam enters into the magnetic field and suddenly see the magnetic 
field pressure, B?/2u9. The Compton gain for such a case becomes?! 


2 vy. 3/2 
rg ~ 0.3 227 ie ) . 82 
? Ws C \UT ( 
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One important remark should be made here. We obtained Raman and 
Compton gains by taking the asymptotic limits of k} « kp and ki > kp, 
respectively, to have simple analytic expressions. However, this does not 
mean that the gain at the transition regime cannot be obtained, nor that 
an abrupt transition exists between the two regimes. In fact, the unified 
dispersion relation which covers the entire regime can be obtained by 
using the Vlasov equation and by simply including the self-consistent 
electric field E) in (72). If we further allow a situation that the scattered 
wave may not propagate in the beam direction, the unified dispersion 
relation which is expressed in the form of eq. (68) becomes 


__kixt (ke) | |k, X vi |? [ky x v;|? | =0. (83) 
1+ xi (ki,w) Lk? Dy (ks,w; —w) RFPD(RT,w; + @) 


The gain for the entire regime is obtained by numerically solving this 
equation for w. 


VI. LIMITING GAIN AND OUTPUT POWER 


In the previous two sections, temporal growth rates for stimulated 
Raman and stimulated Raman scatterings were obtained. We summarize 
the result in the following, by using k; ~ 2|k;| ~ 2w,/c, and ws ~ ;j. 
Raman gain (beam frame) 


. - 1/2 
rgelil oye, ici, S) (84) 
c Cc Wi 
.|2 ; 1/2 
Tg = (28 vi wai) vs, jg lvl, (“) (85) 
c Cc Ww; 
Compton gain (beam frame) 
2/2532 : 
poe apteey (86) 
UT WW UT 
lv;| w2 7 ¢ \3/2 Kei 
rg = 0.3 vi Se j) if. (87) 
C wi \UT UT 


The gain in all cases depends on the pump intensity v;. If one uses the 
helical magnetic pump, as we have shown in Section VI, an optimum 
value exists in the pump magnetic field B , , which is given by eq. (64). 
The corresponding velocity v; becomes |v;|/c ~ 1/2. If we use this 
value, the Raman (OTSI) and Compton gains become 


Tmax & (wpwi)'/8, applicable for w; « ie ns (88) 


Y 3/2 w2 : Y 
T'S max & 0.2 a, =p licable for #; > —— wp. (89 
B me Pe applicable for w; re Wp (89) 


Here w; = 2Ay/y wp corresponds to the critical frequency, eq. (37) be- 
tween the two regimes, that is the incident frequency for k; = kp. 
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We see that the growth rate increases gradually as w; is increased and 
then decreases in proportion to w; 1. If we take an example of a best 
quality beam with Ay ~ 10-3 y, T'§ max at the critical frequency is given 
approximately by 


4 Y 1/2 
T'S max & 0.4 wp Ga =~ 12 wp. 


On the other hand, at the same frequency, 


1/3 
rg max ~ Wp Gre, ~ 7.8 Wp. 

This indicates that, at the critical frequency, the Raman and Compton 
gains are approximately the same. If we now express the plasma fre- 
quency in terms of the beam current density Jo, wp = 8.14 X 10° VJ, 
hence the maximum growth rate in the beam frame is approximately 
given by ['g max ~ 10 wp ~ 108 VJ .Asan example, if we take a nominal 
parameter of “microtron” !2 beam with a current of 1 A with the cross 
section of 1 mm?, J = 10° A/m?2. Thus, 'g max ~ 10!! sec7!. We also note 
that the gain in the laboratory frame Iz, is given by I'g/2y. For a nominal 
value of y = 103, the laboratory frame gain is 5 X 108 sec—!. Hence the 
e-folding distance L = c/', ~ 1m. The e-folding distance at a lower 
frequency becomes shorter in proportion to w; 8, while at a higher fre- 
quency becomes longer in proportion to «;. 

These arguments may be summarized as follows. If we define the 
critical frequency given by (37) as the limiting frequency that the free 
electron laser can operate, the minimum e-folding distance in the labo- 
ratory frame L,, and w,, can be expressed in terms of Jo, y and y/Ay. 

The maximum lasing frequency, f¢,: 














fer = = 1.3 X 108 y Ge) [Jo(A/m2)]1/2 Hz. (90) 
2 Ay 
The minimum e-folding distance, L»: 
Lm = —— = 93 () oY Fe NifeoP\ 12 vi, (91) 
Ty max Y 
Condition to achieve L,,: 
eB = _ Mer 
= koc = by? 
or 
B, (W/m?) = 1.8 X 10714 me (92) 


Note that the beam pulse length (Fig. 1) is not a crucial parameter so long 
as it is longer than, say, 10 kj ' because it runs at the same speed as the 
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scattered light. If we take again the previous examples of microtron,!? 
Jo = 106 A/m?2, y = 102, and y/Ay = 10°, we have 
fer = 1.3 X 1014 Hz 
Lm = 0.93 m 
By = 2.3 X 10-! W/m? 
Qa/2 
No = 2a/ko = = 4.6 x 10-2 m. 

Tee 

Let us now discuss the maximum output power of the laser. Because 
Lm is on the order of 1 m, it takes a relatively long system to achieve the 
saturation in gain. But let us assume that the system is infinitely long 
and ask ourselves what causes the saturation of the gain. 

As we have found, when the energy spread of the beam becomes large 
so that k; < kp, the gain drops in proportion to w; !. When the scattered 
power is increased, it produces a larger v X B(= v; X B,) force which 
traps the beam electrons and increases its energy spread. The trapping 
potential ¢; due to the Lorentz force v; X B, in the beam frame is ob- 
tained from 


| 
St! = |p| ~ |v,B: 
ae] = Pavol = o.B5 
or 
1 
be = 7 [vil |Bsl. (93) 
1 


The effective thermal speed vers produced by the trapping potential 
pt is 





a) a (94) 


UTeff = ( 
m 

We can consider that the saturation occurs when kj ~ wp/U Terr because 
if Upere is made larger than this critical value, the gain changes from 
Raman to Compton. Hence, the maximum amplitude of the magnetic 
field of the scattered wave is given by 
Fa ec) Sk RR od | eee (95) 

(2e¢:/m)¥/2  (2e|v;| | Bs |/kim)1? 
or by solving B, using |v;| ~ c, we have 


ky 


2 2 
UE On a 


: 96 
eck; e w; 6) 


Ss 


If we operate at the maximum gain, w; = we,/2y = wp (y/Ay)/2. Hence, 
we must use as the maximum scattered field 
A 
B, = 2 w, —, (97) 
e Y 


FREE ELECTRON LASER 3087 


and the corresponding electric field is 
E, = cBs. (98) 


If we Lorentz-transform these fields to the laboratory frame according 
to eqs. (5) and (7), we have 


Brs = 2yB, 
and 

Eys = 2yEs. (99) 
Hence, the maximum output power P,, is given by 


Pm = ExsBris/bo 


= 1672 Can nD (=) co 
e - Y Ho 


Avy 2 
= 16 ee. PBeam; (100) 
Y 
where Pgeam is the beam kinetic power density, 
PBeam = mc#?yn. (101) 


Equation (100) shows that the conversion efficiency is roughly given by 
16(Ay/y)?. This may be misleading, because it shows that the poorer 
quality beam gives better efficiency. This comes from the dependency 
of B, on w;! so that the lower the frequency the longer the saturation 
field. When a poor quality beam is used, the efficiency may become 
better but with a sacrifice of lowering the laser frequency. 

If we use the same example of parameters, y = 102, Ay/y = 107? and 
1 A beam, the maximum output power of the laser becomes 800 W. 


Vil. CONCLUSION 


Use of stimulated backscattering of a pump field by a relativistic 
electron beam for a tunable laser was discussed. The temporal gain and 
the e-folding distance in the laboratory frame are obtained for both 
stimulated Raman and stimulated Compton scattering regimes. It is 
shown that in the stimulated Compton regime, the gain drops in pro- 
portion to the lasing frequency hence is not a practical regime to deploy. 
If we consider that the transition frequency from the Raman to the 
Compton regime is the maximum lasing frequency, the lasing frequency 
can be obtained as a function of the beam energy y, the relative energy 
spread of the beam Ay/y, and the current density Jo as shown in (90). 
The e-folding distance corresponding to this frequency is shown in eq. 
(91). For a nominal value of the available relativistic electron beam, these 
quantities become approximately 10!4 Hz and 1 m. The maximum power 
output corresponding to this operation condition is also obtained and 
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shown to be given by (100). Again for the nominal value of the beam 
parameter, the output laser power becomes about one kilowatt. These 
results indicate that the use of a relativistic beam with y of 100 and Ay/y 
of 10-8 can produce a tunable laser with an optimum operating frequency 
approaching to the visible. However, extending this process into X-ray 
regime seems extremely difficult. 
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